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ADENO-ASSOCIATED VIRUS SEROTYPE 1 NUCLEIC ACID 
SEQUENCES, VECTORS AND HOST CELLS CONTAINING SAME 

This work was supported by the National Institutes of Health, grant no. P30 
DK47757-06 and POl HD32649-04. The US government may have certain rights in 
5 this invention. 

Field of the Invention 

This invention relates generally to viral vector, and more particularly, to 
recombinant viral vectors useful for gene delivery. 

Background of the Invention 

10 Adeno-associated viruses are small, single-stranded DNA viruses which 

require helper virus to facilitate efficient replication [K.I. Berns, Parvoviridqe: the 
viruses and their replication, p. 1007-1041, in F.N. Fields et al., Fundamental 
virology , 3rd ed,, vol. 2, (Lippencott-Raven Publishers, Philadelphia, PA) (1995)]. 
The 4.7 kb genome of AAV is characterized by two inverted terminal repeats (ITR) 

15 and two open reading frames which encode the Rep proteins and Cap proteins, 

respectively. The Rep reading frame encodes four proteins of molecular weight 78 
kD, 68 kD, 52 kD and 40 kD. These proteins function mainly in regulating AAV 
replication and integration of the AAV into a host cell's chromosomes. The Cap 
reading frame encodes three structural proteins in molecular weight 85 kD (VP 1), 72 

20 kD (VP2) and 61 kD (VP3) [Berns, cited above]. More than 80% of total proteins in 
AAV virion comprise VP3. The two ITRs are the only cis elements essential for AAV 
replication, packaging and integration. There are two conformations of AAV ITRs 
called "flip" and "flop". These differences in conformation originated from the 
replication model of adeno-associated virus which use the ITR to initiate and reinitiate 

25 the replication [R.O. Snyder et al., J. Virol. . 67:6096-6104 (1993); K.I. Berns, 
Microbiological Reviews . 54:316-329 (1990)]. 

AAVs have been found in many animal species, including primates, canine, 
fowl and human [F. A. Murphy et al., "The Classification and Nomenclature of 
Viruses: Sixth Report of the International Committee on Taxonomy of Viruses", 
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Archives of Virology . (Springer-Verlag. Vienna) (1995)], In addition to five known , 
primate AAVs (AAV-1 to AAV-5), AAV-6, another serotype closely related to 
AAV-2 and AAV-1 has also been isolated [E. A. Rutledge et al., J. Virol. . 72:309- 
319 (1998)]. Among all known AAV serotypes, AAV-2 is perhaps the most well- 
5 characterized serotype, because its infectious clone was the first made [R.J. Samulski 
et al., Proc. Natl. Acad. Sci. USA . 79:2077-2081 (1982)]. Subsequently, the full 
sequences for AAV-3 A, AAV-3B, AAV-4 and AAV-6 have also been determined 
[Rutledge, cited above; J.A.Chiorini et al., J. Virol. . 71:6823-6833 (1997); S. 
Muramatsu et al, Virol. . 221:208-217(1996)]. Generally, all AAVs share more than 

10 80% homology in nucleotide sequence. 

A number of unique properties make AAV a promising vector for human gene 
therapy [Muzyczka, Current Topics in Microbiology and Immunology, 158:97-129 
(1992)]. Unlike other viral vectors, AAVs have not been shown to be associated with 
any known human disease and are generally not considered pathogenic. Wild type 

15 AAV is capable of integrating into host chromosomes in a site specific manner [R. M. 
Kotin et al., Proc, Natl. Acad. Sci, USA . 87:221 1-2215 (1990)- R.J. Samulski, 
EMBO J., K)(12):3941-3950 (1991)]. Recombinant AAV vectors can integrate into 
tissue cultured cells in chromosome 19 if the rep proteins are supplied in trans [C. 
Balague et al., J. Virol. , 71:3299-3306 (1997); R. T. Surosky et al, J. Virol , 

20 71:7951-7959 (1997)]. The integrated genomes of AAV have been shown to allow 
long term gene expression in a number of tissues, including, muscle, liver, and brain 
[K. J. Fisher, Nature Med. , 3(3):306-312 (1997); R. 0. Snyder et al, Nature 
Genetics , J6:270-276 (1997); X. Xiao et al., Experimental Neurology . 144:113-124 
(1997); Xiao, J. Virol. , 70(11):8098-8108 (1996)]. 

25 AAV-2 has been shown to be present in about 80-90% of the human 

population. Earlier studies showed that neutralizing antibodies for AAV-2 are 
prevalent [W. P. Parks et al., J. Virol. . 2:716-722 (1970)]. The presence of such 
antibodies may significantly decrease the usefulness of AAV vectors based on AAV-2 
despite its other merits. What are needed in the art are vectors characterized by the 
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advantages of AAV-2, including those described above, without the disadvantages, 
including the presence of neutralizing antibodies. 



Summary of the Invention 

In one aspect, the invention provides an isolated AAV-l nucleic acid molecule 
5 which is selected from among SEQ ID NO: 1, the strand complementary to SEQ ID 
NO: 1, and cDNA and RNA sequences complementary to SEQ ID NO: 1 and its 
complementary strand. 

In another aspect, the present invention provides AAV ITR sequences, which 
include the 5' ITR sequences, nt 1 to 143 of SEQ ID NO: 1; the 3* ITR sequences, nt 
10 4576 to 4718 of SEQ ID NO: 1, and fragments thereof. 

In yet another aspect, the present invention provides a recombinant vector 
comprising an AAV-l ITR and a selected transgene. Preferably, the vector comprises 
both the 5 ! and 3' AAV-l ITRs between which the selected transgene is located. 

In still another aspect, the invention provides a recombinant vector comprising 
15 an AAV-l P5 promoter having the sequence of nt 236 to 299 of SEQ ID NO: 1 or a 
functional fragment thereof. 

In a further aspect, the present invention provides a nucleic acid molecule 
encoding an AAV-l rep coding region and an AAV-l cap coding region. 
In still another aspect, the present invention provides a host cell transduced with a 
20 recombinant viral vector of the invention. The invention further provides a host cell 
stably transduced with an AAV-l P5 promoter of the invention. 

In still a further aspect, the present invention provides a pharmaceutical 
composition comprising a carrier and a vector of the invention. 

In yet another aspect, the present invention provides a method for AAV— 
25 mediated delivery of a transgene to a host involving the step of delivering to a selected 
host a recombinant viral vector comprising a selected transgene under the control of 
sequences which direct expression thereof and an adeno-associated virus 1 (AAV-l) 
virion. 
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In another aspect, the invention provides a method for in vitro production of a 
selected gene product using a vector of the invention. 

Other aspects and advantages of the invention will be readily apparent to one 
of skill in the art from the detailed description of the invention. 

Brief Description of the Drawings 

Figs. 1 A-1C illustrate the alignment of nucleotides of AAV-1 [SEQ ID NO: 
1], AAV-2 [SEQ ID NO: 18] and AAV-6 [SEQ ID NO: 19]. The alignment was 
done with MacVector 6.0. The full sequences of AAV-1 are shown in the top line. 
Nucleotides in AAV-2 and AAV-6 identical to AAV-1 are symbolized by "." and gaps 
by Some of the conserved features among AAVs are marked in this figure. Note 
the 3' ITRs of AAV-1 and AAV-6 are shown in different orientations. 

Fig. 2 illustrates the predicted secondary structure of AAV-1 ITR The 
nucleotides in AAV-2 and AAV-6 are shown in italic and bold respectively. 

Fig. 3 A illustrates a hypothesis of how AAV-6 arose from the homologous 
recombination between AAV-1 and AAV-2. The major elements of AAV-1 are 
indicated in the graph. A region that is shared between AAV-1, AAV-2 and AAV-6 
is shown in box with waved lines. 

Fig. 3B is a detailed illustration of a 71 bp homologous region among AAV-1, 
AAV-2 and AAV-6. Nucleotides that differ among these serotypes are indicated by 
arrows. - 

Fig. 4 A is a bar chart illustrating expression levels of human alpha 1 anti- 
trypsin (al AT) in serum following delivery of hAAT via recombinant AAV-1 and 
recombinant AAV-2 viruses. 

Fig. 4B is a bar chart illustrating expression levels of erythropoietin (epo) in 
serum following delivery of the epo gene via recombinant AAV-1 and recombinant 
AAV-2 viruses. 

Fig. 5 A is a bar chart illustrating expression levels of al AT in liver following 
delivery of a 1 AT as described in Example 7. 
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Fig. 5B is a bar chart demonstrating expression levels of epo in liver following 
delivery of epo as described in Example 7. 

Fig. 5C is a bar chart demonstrating neutralizing antibodies (NAB) directed to 
AAV-1 following delivery of al AT or epo to liver as described in Example 7. 
5 Fig. 5D is a bar chart demonstrating neutralizing antibodies (NAB) directed to 

AAV-2 following delivery of al AT or epo to liver as described in Example 7. 

Fig. 6A is a bar chart illustrating expression levels of al AT in muscle 
following delivery of al AT as described in Example 7. 

Fig. 6B is a bar chart demonstrating expression levels of epo in muscle 
10 following delivery of epo as described in Example 7. 

Fig. 6C is a bar chart demonstrating neutralizing antibodies (NAB) directed to 
AAV-1 following delivery of al AT or epo to muscle as described in Example 7. 

Fig. 6D is a bar chart demonstrating neutralizing antibodies (NAB) directed to 
AAV-2 following delivery of al AT or epo to muscle as described in Example 7. 

15 Detailed Description of the Invention 

The present invention provides novel nucleic acid sequences for an adeno— 
associated virus of serotype 1 (AAV-1). Also provided are fragments of these AAV-1 
sequences. Among particularly desirable AAV-1 fragments are the inverted terminal 
repeat sequences (lTRs), rep and cap. Each of these fragments may be readily 

20 utilized, e.g., as a cassette, in a variety of vector systems and host cells. Such 
fragments may be used alone, in combination with other AAV-1 sequences or 
fragments, or in combination with elements from other AAV or non-AAV viral 
sequences. In one particularly desirable embodiment, a cassette may contain the 
AAV-1 ITRs of the invention flanking a selected transgene. In another desirable 

25 embodiment, a cassette may contain the AAV-1 rep and/or cap proteins, e.g., for use 
in producing recombinant (rAAV) virus. 

Thus, the AAV-1 sequences and fragments thereof are useful in production of 
rAAV, and are also useful as antisense delivery vectors, gene therapy vectors, or 
vaccine vectors. The invention further provides nucleic acid molecules, gene delivery 



ft 
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vectors, and host cells which contain the AAV-l sequences of the invention. Also 
provided a novel methods of gene delivery using AAV vectors. 

As described herein, the vectors of the invention containing the AAV-l capsid 
proteins of the invention are particularly well suited for use in applications in which 
5 the neutralizing antibodies diminish the effectiveness of other AAV serotype based 
vectors, as well as other viral vectors. The rAAV vectors of the invention are 
particularly advantageous in rAAV readministration and repeat gene therapy. 

These and other embodiments and advantages of the invention are described in 
more detail below. As used throughout this specification and the claims, the term 
10 "comprising" is inclusive of other components, elements, integers, steps and the like. 

1. AAV-l NUCLEIC ACID AND PROTEIN SEQUENCES 

The AAV-l nucleic acid sequences of the invention include the DNA 
sequences of SEQ ID NO: 1 (Figs. 1 A-1C), which consists of 4718 nucleotides. The 
AAV-l nucleic acid sequences of the invention further encompass the strand which is 

15 complementary to SEQ ID NO: 1, as well as the RNA and cDNA sequences 

corresponding to SEQ ID NO: 1 and its complementary strand. Also included in the 
nucleic acid sequences of the invention are natural variants and engineered 
modifications of SEQ ID NO: 1 and its complementary strand. Such modifications 
include, for example, labels which are known in the art, methylation, and substitution 

20 of one or more of the naturally occurring nucleotides with an analog. 

Further included in this invention are nucleic acid sequences which are greater 
than 85%, preferably at least about 90%, more preferably at least about 95%, and 
most preferably at least about 98 - 99% identical or homologous to SEQ ID NO:l. 
The term "percent sequence identity" or "identical" in the context of nucleic acid 

25 sequences refers to the residues in the two sequences which are the same when 

aligned for maximum correspondence. The length of sequence identity comparison 
may be over the full-length sequence, or a fragment at least about nine nucleotides, 
usually at least about 20 - 24 nucleotides, at least about 28 - 32 nucleotides, and 
preferably at least about 36 or more nucleotides. There are a number of different 
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algorithms known in the art which can be used to measure nucleotide sequence 
identity. For instance, polynucleotide sequences can be compared using Fasta, a 
program in GCG Version 6. 1 . Fasta provides alignments and percent sequence 
identity of the regions of the best overlap between the query and search sequences 
5 (Pearson, 1990, herein incorporated by reference). For instance, percent sequence 
identity between nucleic acid sequences can be determined using Fasta with its default 
parameters (a word size of 6 and the NOP AM factor for the scoring matrix) as 
provided in GCG Version 6.1, herein incorporated by reference. 

The term "substantial homology" or "substantial similarity," when referring to 
10 a nucleic acid or fragment thereof, indicates that, when optimally aligned with 
appropriate nucleotide insertions or deletions with another nucleic acid (or its 
complementary strand), there is nucleotide sequence identity in at least about 95 - 
99% of the sequence. 

Also included within the invention are fragments of SEQ ID NO: 1, its 
15 complementary strand, cDNA and RNA complementary thereto. Suitable fragments 
are at least 1 5 nucleotides in length, and encompass functional fragments which are of 
biological interest. Certain of these fragments may be identified by reference to Figs. 
1 A-1C. Examples of particularly desirable functional fragments include the AAV-1 
inverted terminal repeat (ITR) sequences of the invention. In contrast to the 145 nt 
20 ITRs of AAV-2, AAV-3, and AAV-4, the AAV-1 ITRs have been found to consist of 
only 143 nucleotides, yet advantageously are characterized by the T-shaped hairpin 
structure which is believed to be responsible for the ability of the AAV-2 ITRs to 
direct site-specific integration. In addition, AAV-1 is unique among other AAV 
serotypes, in that the 5' and 3' ITRs are identical. The full-length 5' ITR sequences of 
25 AAV-1 are provided at nucleotides 1-143 of SEQ ID NO: 1 (Fig. 1 A) and the full- 
length 3* ITR sequences of AAV-1 are provided at nt 4576-4718 of SEQ ID NO: 1 
(Fig. 1C). One of skill in the art can readily utilize less than the full-length 5' and/or 3* 
ITR sequences for various purposes and may construct modified ITRs using 
conventional techniques, e.g., as described for AAV-2 ITRs in Samulski et al, Cell . 
30 33:135-143 (1983). 
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Another desirable functional fragment of the AAV-1 genome is the P5 
promoter of AAV-1 which has sequences unique among AAV P5 promoters, while 
maintaining critical regulatory elements and functions. This promoter is located 
within nt 236 - 299 of SEQ ID NO: 1 (Fig. 1 A). Other examples of functional 
5 fragments of interest include the sequences at the junction of the rep/cap, e.g., the 
sequences spanning nt 2306-2223, as well as larger fragments which encompass this 
junction which may comprise 50 nucleotides on either side of this junction. Still other 
examples of functional fragments include the sequences encoding the rep proteins. 
Rep 78 is located in the region of nt 334 - 2306 of SEQ ID NO: 1; Rep 68 is located 

10 in the region of nt 334-2272, and contains an intron spanning nt 1924-2220 of SEQ 
ID NO: 1. Rep 52 is located in the region of nt 1007 - 2304 of SEQ ID NO: 1; rep 40 
is located in the region of nt 1007 - 2272, and contains an intron spanning nt 1924- 
2246 of SEQ ID NO: 1. Also of interest are the sequences encoding the capsid 
proteins, VP 1 [nt 2223-4431 of SEQ ID NO: 1], VP2 [nt 2634-4432 of SEQ ID NO: 

15 1] and VP3 [nt 2829-4432 of SEQ ID NO: 1]. Other fragments of interest may 

include the AAV-1 PI 9 sequences, AAV-1 P40 sequences, the rep binding site, and 
the terminal resolute site (TRS). 

The invention further provides the proteins and fragments thereof which are 
encoded by the AAV-1 nucleic acids of the invention. Particularly desirable proteins 

20 include the rep and cap proteins, which are encoded by the nucleotide sequences 
identified above. These proteins include rep 78 [SEQ ID NO:5], rep 68 [SEQ ID 
NO:7], rep 52 [SEQ ID NO:9], rep 40 [SEQ ID NO: 1 1], vpl [SEQ ID NO: 13], vp2 
[SEQ ID NO: 15], and vp3 [SEQ IID NO: 17] and functional fragments thereof while 
the sequences of the rep and cap proteins have been found to be closely related to 

25 those of AAV-6, there are differences in the amino acid sequences (see Table 1 
below), as well as differences in the recognition of these proteins by the immune 
system. However, one of skill in the art may readily select other suitable proteins or 
protein fragments of biological interest. Suitably, such fragments are at least 8 amino 
acids in length. However, fragments of other desired lengths may be readily utilized. 
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Such fragments may be produced recombinantly or by other suitable means, e.g., 
chemical synthesis. 

The sequences, proteins, and fragments of the invention may be produced by 
any suitable means, including recombinant production, chemical synthesis, or other 
synthetic means. Such production methods are within the knowledge of those of skill 
in the art and are not a limitation of the present invention. 

11. VIRAL VECTORS 

In another aspect, the present invention provides vectors which utilize the 
AAV-1 sequences of the invention, including fragments thereof, for delivery of a 
heterologous gene or other nucleic acid sequences to a target cell. Suitably, these 
heterologous sequences (i.e., a transgene) encode a protein or gene product which is 
capable of being expressed in the target cell. Such a transgene may be constructed in 
the form of a "minigene". Such a "minigene" includes selected heterologous gene 
sequences and the other regulatory elements necessary to transcribe the gene and 
express the gene product in a host cell. Thus, the gene sequences are operatively 
linked to regulatory components in a manner which permit their transcription. Such 
components include conventional regulatory elements necessary to drive expression of 
the transgene in a cell containing the viral vector. The minigene may also contain a 
selected promoter which is linked to the transgene and located, with other regulatory 
elements, within the selected viral sequences of the recombinant vector. 

Selection of the promoter is a routine matter and is not a limitation of this 
invention. Useful promoters may be constitutive promoters or regulated (inducible) 
promoters, which will enable control of the timing and amount of the transgene to be 
expressed. For example, desirable promoters include the cytomegalovirus (CMV) 
immediate early promoter/enhancer [see, e.g., Boshart et al, Cell, 41:521-530 (1985)], 
the Rous sarcoma virus LTR promoter/enhancer, and the chicken cytoplasmic p-actin 
promoter [T. A. Kost et al, Nucl. Acids Res. . ii(23):8287 (1983)]. Still other 
desirable promoters are the albumin promoter and an AAV P5 promoter. Optionally, 
the selected promoter is used in conjunction with a heterologous enhancer, e.g., the (3- 
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actin promoter may be used in conjunction with the CMV enhancer. Yet other 
suitable or desirable promoters and enhancers may be selected by one of skill in the 
art. 

The minigene may also desirably contain nucleic acid sequences heterologous 
5 to the viral vector sequences including sequences providing signals required for 

efficient polyadenylation of the transcript (poly-A or pA) and introns with functional 
splice donor and acceptor sites. A common poly-A sequence which is employed in 
the exemplary vectors of this invention is that derived from the papovavirus SV-40. 
The poly-A sequence generally is inserted in the minigene downstream of the 

10 transgene sequences and upstream of the viral vector sequences. A common intron 
sequence is also derived from SV-40, and is referred to as the SV40 T intron 
sequence. A minigene of the present invention may also contain such an intron, 
desirably located between the promoter/enhancer sequence and the transgene. 
Selection of these and other common vector elements are conventional [see, e.g., 

15 Sambrook et al, "Molecular Cloning. A Laboratory Manual", 2d edit., Cold Spring 
Harbor Laboratory, New York (1989) and references cited therein] and many such 
sequences are available from commercial and industrial sources as well as from 
Genebank. 

The selection of the transgene is not a limitation of the present invention. 

20 Suitable transgenes may be readily selected from among desirable reporter genes, 
therapeutic genes, and optionally, genes encoding immunogenic polypeptides. 
Examples of suitable reporter genes include p-galactosidase (p-gal), an alkaline 
phosphatase gene, and green fluorescent protein (GFP). Examples of therapeutic 
genes include, cytokines, growth factors, hormones, and differentiation factors, 

25 among others. The transgene may be readily selected by one of skill in the art. See, 
e.g., WO 98/09657, which identifies other suitable transgenes. 

Suitably, the vectors of the invention contain, at a minimum, cassettes which 
consist of fragments of the AAV-1 sequences and proteins. In one embodiment, a 
vector of the invention comprises a selected transgene, which is flanked by a 5' ITR 

30 and a 3' ITR, at least one of which is an AAV-1 ITR of the invention. Suitably, 
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vectors of the invention may contain a AAV-l P5 promoter of the invention. In yet 
another embodiment, a plasmid or vector of the invention contains AAV-l rep 
sequences. In still another embodiment, a plasmid or vector of the invention contains 
at least one of the AAV-l cap proteins of the invention. Most suitably, these AAV-1- 
derived vectors are assembled into viral vectors, as described herein. 
A. AAV Viral Vectors 

In one aspect, the present invention provides a recombinant AAV-l 
viral vector produced using the AAV-l capsid proteins of the invention. The 
packaged rAAV-1 virions of the invention may contain, in addition to a selected 
minigene, other AAV-l sequences, or may contain sequences from other AAV 
serotypes. 

Methods of generating rAAV virions are well known and the selection 
of a suitable method is not a limitation on the present invention. See, e.g., K. Fisher 
et al, J. Virol. . 70:520-532 (1993) and US Patent 5,478,745. In one suitable method, 
a selected host cell is provided with the AAV sequence encoding a rep protein, the 
gene encoding the AAV cap protein and with the sequences for packaging and 
subsequent delivery. Desirably, the method utilizes the sequences encoding the AAV- 
1 rep and/or cap proteins of the invention. 

In one embodiment, the rep/cap genes and the sequences for delivery 
are supplied by co-transfection of vectors carrying these genes and sequences. In one 
currently preferred embodiment, a cis (vector) plasmid, a trans plasmid containing the 
rep and cap genes, and a plasmid containing the adenovirus helper genes are co— 
transfected into a suitable cell line, e.g., 293. Alternatively, one or more of these 
functions may be provided in trans via separate vectors, or may be found in a suitably 
engineered packaging cell line. 

An exemplary cis plasmid will contain, in 5 ! to 3' order, AAV 5' ITR, 
the selected transgene, and AAV 3' ITR. In one desirable embodiment, at least one of 
the AAV ITRs is a 143 nt AAV-l ITR. However, other AAV serotype ITRs may be 
readily selected. Suitably, the full-length ITRs are utilized. However, one of skill in 
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the art can readily prepare modified AAV.ITRs using conventional techniques. 
Similarly, methods for construction of such plasmids is well known to those of skill in 
the art. 

A trans plasmid for use in the production of the rAAV-1 virion particle 
may be prepared according to known techniques. In one desired embodiment, this 
plasmid contains the rep and cap proteins of AAV-1, or functional fragments thereof. 
Alternatively, the rep sequences may be from another selected AAV serotype. 

The cis and trans plasmid may then be co-transfected with a wild-type 
helper virus (e.g., Ad2, Ad5, or a herpesvirus), or more desirably, a replication - 
defective adenovirus, into a selected host cell. Alternatively, the cis and trans plasmid 
may be co-transfected into a selected host cell together with a transfected plasmid 
which provides the necessary helper functions. Selection of a suitable host cell is well 
within the skill of those in the art and include such mammalian cells as 293 cells, HeLa. 
cells, among others. 

Alternatively, the cis plasmid and, optionally the trans plasmid, may be 
transfected into a packaging cell line which provides the remaining helper functions 
necessary for production of a rAAV containing the desired AAV-1 sequences of the 
invention. An example of a suitable packaging cell line, where an AAV-2 capsid is 
desired, is B-50, which stably expresses AAV-2 rep and cap genes under the control 
of a homologous P5 promoter. This cell line is characterized by integration into the 
cellular chromosome of multiple copies (at least 5 copies) of P5-rep-cap gene 
cassettes in a concatomer form. This B-50 cell line was deposited with the American 
Type Culture Collection, 10801 University Boulevard, Manassas, Virginia 201 10- 
2209, on September 18, 1997 under Accession No. CRL- 12401 pursuant to the 
provisions of the Budapest Treaty. However, the present invention is not limited as to 
the selection of the packaging cell line. 

Exemplary transducing vectors based on AAV-1 capsid proteins have 
been tested both in vivo and in vitro, as described in more detail in Example 4. In 
these studies, it was demonstrated that recombinant AAV vector with an AAV-1 
virion can transduce both mouse liver and muscle. These, and other AAV-1 based 
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gene therapy vectors which may be generated by one of skill in the art are beneficial 
for gene delivery to selected host cells and gene therapy patients since the 
neutralization antibodies of AAV- 1 present in much of the human population exhibit 
different patterns from other AAV serotypes and therefore do not neutralize the 
5 AAV-1 virions. One of skill in the art may readily prepare other rAAV viral vectors 
containing the AAV-1 capsid proteins provided herein using a variety of techniques 
known to those of skill in the art. One may similarly prepare still other rAAV viral 
vectors containing AAV-1 sequence and AAV capsids of another serotype. 
B. Other Viral Vectors 

10 One of skill in the art will readily understand that the AAV-1 

sequences of the invention can be readily adapted for use in these and other viral 
vector systems for in vitro, ex vivo or in vivo gene delivery. Particularly well suited 
for use in such viral vector systems are the AAV-1 ITR sequences, the AAV-1 rep, 
the AAV-1 cap, and the AAV-1 P5 promoter sequences. 

15 For example, in one desirable embodiment, the AAV-1 ITR sequences 

of the invention may be used in an expression cassette which includes AAV-1 5' ITR, 
a non-AAV DNA sequences of interest (e.g., a minigene), and 3' ITR and which lacks 
functional rep/cap. Such a cassette containing an AAV-1 ITR may be located on a 
plasmid for subsequent transfection into a desired host cell, such as the cis plasmid 

2 0 described above. This expression cassette may further be provided with an AAV 

capsid of a selected serotype to permit infection of a cell or stably transfected into a 
desired host cell for packaging of rAAV virions. Such an expression cassette may be 
readily adapted for use in other viral systems, including adenovirus systems and 
lent i virus systems. Methods of producing Ad/AAV vectors are well known to those 
25 of skill in the art. One desirable method is described in PCT/US95/14018. However, 
the present invention is not limited to any particular method. 

Another aspect, of the present invention is the novel AAV-1 P5 
promoter sequences which are located in the region spanning nt 236 - 299 of SEQ ID 
NO: 1. This promoter is useful in a variety of viral vectors for driving expression of a 

3 0 desired transgene. 
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Similarly, one of skill in the art can readily select other fragments of the 
AAV-1 genome of the invention for use in a variety of vector systems. Such vectors 
systems may include, e.g., lenti viruses, retroviruses, poxviruses, vaccinia viruses, and 
adenoviral systems, among others. Selection of these vector systems is not a 
limitation of the present invention. 

C. Host Cells And Packaging Cell Lines 

In yet another aspect, the present invention provides host cells which 
may be transiently transfected with AAV-1 nucleic acid sequences of the invention to 
permit expression of a desired transgene or production of a rAAV particle. For 
example, a selected host cell may be transfected with the AAV-1 P5 promoter 
sequences and/or the AAV-1 5* ITR sequences using conventional techniques. 
Providing AAV helper functions to the transfected cell lines of the invention results in 
packaging of the rAAV as infectious rAAV particles. Such cell lines may be produced 
in accordance with known techniques [see, e.g, US Patent No. 5,658,785], making 
use of the AAV-1 sequences of the invention. 

Alternatively, host cells of the invention may be stably transfected with 
a rAAV expression cassette of the invention, and with copies of AAV-1 rep and cap 
genes. Suitable parental cell lines include mammalian cell lines and it may be desirable 
to select host cells from among non-simian mammalian cells. Examples of suitable 
parental cell lines include, without limitation, HeLa [ATCC CCL 2], A549 [ATCC 
Accession No. CCL 185], KB [CCL 17], Detroit [e.g., Detroit 510, CCL 72] and WI- 
38 [CCL 75] cells. These cell lines are all available from the American Type Culture 
Collection, 10801 University Boulevard, Manassas, Virginia 201 10-2209 USA. Other 
suitable parent cell lines may be obtained from other sources and may be used to 
construct stable cell lines containing the P5 and/or AAV rep and cap sequences of the 
invention. 

Recombinant vectors generated as described above are useful for 
delivery of the DNA of interest to cells. 



WO 00/28061 



PCT/US99/25694 



15 

III. METHODS OF DELIVERING GENES VIA AAV-1 DERIVED VECTORS 
In another aspect, the present invention provides a method for delivery of a 
transgene to a host which involves transfecting or infecting a selected host cell with a 
recombinant viral vector generated with the AAV-1 sequences (or functional 
fragments thereof) of the invention. Methods for delivery are well known to those of 
skill in the art and are not a limitation of the present invention. 

In one desirable embodiment, the invention provides a method for AAV— 
mediated delivery of a transgene to a host. This method involves transfecting or 
infecting a selected host cell with a recombinant viral vector containing a selected 
transgene under the control of sequences which direct expression thereof and AAV-1 
capsid proteins. 

Optionally, a sample from the host may be first assayed for the presence of 
antibodies to a selected AAV serotype. A variety of assay formats for detecting 
neutralizing antibodies are well known to those of skill in the art. The selection of 
such an assay is not a limitation of the present invention. See, e.g., Fisher et al, 
Nature Med. , 3(3):306-312 (March 1997) and W. C. Manning et al, Human Gene 
Therapy . 9:477-485 (March 1, 1998). The results of this assay may be used to 
determine which AAV vector containing capsid proteins of a particular serotype are 
preferred for delivery, e.g., by the absence of neutralizing antibodies specific for that 
capsid serotype. 

In one aspect of this method, the delivery of vector with AAV-1 capsid 
proteins may precede or follow delivery of a gene via a vector with a different 
serotype AAV capsid protein. Thus, gene delivery via rAAV vectors may be used for 
repeat gene delivery to a selected host cell. Desirably, subsequently administered 
rAAV vectors carry the same transgene as the first rAAV vector, but the subsequently 
administered vectors contain capsid proteins of serotypes which differ from the first 
vector. For example, if a first vector has AAV-2 capsid proteins, subsequently 
administered vectors may have capsid proteins selected from among the other 
serotypes, including AAV-1, AAV-3A, AAV-3B, AAV-4 and AAV-6. 
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Thus, a rAAV-1 -derived recombinant viral vector of the invention provides an 
efficient gene transfer vehicle which can deliver a selected transgene to a selected host 
cell in vivo or ex vivo even where the organism has neutralizing antibodies to one or 
more AAV serotypes. These compositions are particularly well suited to gene 
5 delivery for therapeutic purposes. However, the compositions of the invention may 
also be useful in immunization. Further, the compositions of the invention may also 
be used for production of a desired gene product in vitro. 

The above-described recombinant vectors may be delivered to host cells 
according to published methods. An AAV viral vector bearing the selected transgene 

10 may be administered to a patient, preferably suspended in a biologically compatible 
solution or pharmaceutical^ acceptable delivery vehicle. A suitable vehicle includes 
sterile saline. Other aqueous and non-aqueous isotonic sterile injection solutions and 
aqueous and non-aqueous sterile suspensions known to be pharmaceutically 
acceptable carriers and well known to those of skill in the art may be employed for 

15 this purpose. 

The viral vectors are administered in sufficient amounts to transfect the cells 
and to provide sufficient levels of gene transfer and expression to provide a 
therapeutic benefit without undue adverse effects, or with medically acceptable 
physiological effects, which can be determined by those skilled in the medical arts. 

20 Conventional and pharmaceutically acceptable routes of administration include, but 
are not limited to, direct delivery to the liver, oral, intranasal, intravenous, 
intramuscular, subcutaneous, intradermal, and other parental routes of administration. 
Routes of administration may be combined, if desired. 

Dosages of the viral vector will depend primarily on factors such as the 

25 condition being treated, the age, weight and health of the patient, and may thus vary 
among patients. For example, a therapeutically effective human dosage^of the viral 
vector is generally in the range of from about 1 ml to about 100 ml of solution 
containing concentrations of from about 1 x 10 9 to 1 x 10 16 genomes virus vector. A 
preferred human dosage may be about 1 x 10 13 to 1 x 10 16 AAV genomes. The 

30 dosage will be adjusted to balance the therapeutic benefit against any side effects and 
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such dosages may vary depending upon the therapeutic application for which the 
recombinant vector is employed. The levels of expression of the transgene can be 
monitored to determine the frequency of dosage resulting in viral vectors, preferably 
AAV vectors containing the minigene. Optionally, dosage regimens similar to those 
described for therapeutic purposes may be utilized for immunization using the 
compositions of the invention. For //; vitro production, a desired protein may be 
obtained from a desired culture following transfection of host cells with a rAAV 
containing the gene encoding the desired protein and culturing the cell culture under 
conditions which permits expression. The expressed protein may then be purified and 
isolated, as desired. Suitable techniques for transfection, cell culturing, purification, 
and isolation are known to those of skill in the art. 

The following examples illustrate several aspects and embodiments of the 
invention. 

Example 1 - Generation of Infectious Clone of AAV-1 

The replicated form DNA of AAV-1 was extracted from 293 cells that were 
infected by AAV-1 and wild type adenovirus type 5. 

A. Cell Culture and Virus 

AAV-free 293 cells and 84-3 1 cells were provided by the human 
application laboratory of the University of Pennsylvania. These cells were cultured in 
Dulbecco's Modified Eagle Medium with 10% fetal bovine serum (Hyclone), penicillin 
(100 U/ml) and streptomycin at 37°C in a moisturized environment supplied with 5% 
CO,. The 84-31 cell line constitutively expresses adenovirus genes El a, Elb, 
E4/ORF6, and has been described previously [K. J. Fisher, J. Virol., 70:520-532 
(1996)]. AAV-1 (ATCC VR-645) seed stock was purchased from American Type 
Culture Collection (ATCC, Manassas, VA). AAV viruses were propagated in 293 
cells with wild type Ad5 as a helper virus. 

B. Recombinant AAV Generation 

The recombinant AAV viruses were generated by transfection using an 
adenovirus free method. Briefly, the cis plasmid (with AAV ITR), trans plasmid (with 
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AAV rep gene and cap gene) and helper plasmid (pFa13, with essential regions from 
the adenovirus genome) were simultaneously co-transfected into 293 cells in a ratio of 
1 :1 :2 by calcium phosphate precipitation. The pFa!3 helper plasmid has an 8 kb 
deletion in the adenovirus E2B region and has deletions in most of the late genes. 
5 This helper plasmid was generated by deleting the RsrII fragment from pFGl40 
(Microbix, Canada). Typically, 50 \xg of DNA (cis:trans:PFAl3 at ratios of 1:1:2, 
respectively) was transfected onto a 15 cm tissue culture dish. The cells were 
harvested 96 hours post-transfection, sonicated and treated with 0.5% sodium 
deoxycholate (37°C for 10 min). Cell lysates were then subjected to two rounds of a 
10 CsCl gradient. Peak fractions containing AAV vector were collected, pooled, and 

dialyzed against PBS before injecting into animals. To make rAAV virus with AAV-1 
virion, the pAVlH or p5E18 (2/1) was used as the tram plasmid to provide rep and 
cap function. 

For the generation of rAAV based on AAV -2, p5E18 was used as the 
15 trans plasmid since it greatly improved the rAAV yield. This plasmid, p5El 8(2/2), 
expresses AAV-2 Rep and Cap and contains a P5 promoter relocated to a position 3* 
to the Cap gene, thereby minimizing expression of Rep78 and Rep68. The strategy 
was initially described by Li et al, J. ViroL 21:5236-5243 (1997). P5E18(2/2) was 
constructed in the following way. The previously described pMMTV-trans vector 
2 0 (i.e., the mouse mammary tumor virus promoter substituted for the P5 promoter in an 
AAV-2-based vector) was digested with Smal and Clal, filled in with the Klenow 
enzyme, and then recircularized with DNA ligase. The resulting construct was 
digested with Xba\ filled in, and ligated to the blunt-ended BamHI-A7?aI fragment 
from pCR-p5, constructed in the following way. The P5 promoter of AAV was 

2 5 amplified by PCR and the amplified fragment was subsequently cloned into pCR2. 1 

(Invitrogen) to yield pCR-P5. The helper plasmid pAVlH was constructed by cloning 
the B/a\ fragment of pAAV-2 into pBluescript II-SK(+) at the BcorV and Smal sites. 
The 3.0-kb Xbal-Kpnl fragment from p5El 8(2/2), the 2.3-kb Xbal-Kpnl fragment 
from pAVlH, and the 1.7-kb Kpnl fragment from p5E18(2/2) were incorporated into 

3 0 a separate plasmid P5E 18(2/1), which contains AAV-2 Rep, AAV-1 Cap, and the 
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AAV-2 P5 promoter located 3' to the Cap gene. Plasmid p5E 18(2/1) produced 10- to 
20-fold higher quantities of the vector than pAVlH (i.e., 10 12 genomes/50 15-cm 2 
plates). 

C. DNA Techniques 
5 Hirt DNA extraction was performed as described in the art with minor 

modification [R.J. Samulski et al., Cell, 33:135-143 (1983)]. More particularly, Hirst 
solution without SDS was used instead of using original Hirt solution containing SDS. 
The amount of SDS present in the original Hirst solution was added after the cells had 
been fully suspended. To construct AAV-1 infectious clone, the Hirt DNA from 

10 AAV-1 infected 293 cells was repaired with Klenow enzyme (New England Biolabs) 
to ensure the ends were blunt. The treated AAV-1 Hirt DNA was then digested with 
BamHl and cloned into three vectors, respectively. The internal BamHl was cloned 
into pBlueScript I1-SK+ cut with BamHl to get pAVl-BM The left and right 
fragments were cloned into pBlueScript II-SK+ cut with BamHl + EcoRV to obtain 

15 pAVl-BL and pAVl-BR, respectively. The AAV sequence in these three plasmids 
were subsequently assembled into the same vector to get AAV-1 infectious clone 
pAAV-1. The helper plasmid for recombinant AAV-1 virus generation was 
constructed by cloning the Bfa I fragment of p AAV-1 into pBlueScript II-SK+ at the 
EcoRV site. 

20 Analysis of the Hirt DNA revealed three bands, a dimer at 9.4 kb, a 

monomer at 4.7 kb and single-stranded DNA at 1.7 kb, which correlated to different 
replication forms of AAV-1. The monomer band was excised from the gel and then 
digested with BamHl. This resulted in three fragments of 1 . 1 kb, 0.8 kb and 2.8 kb. 
This pattern is in accordance with the description by Bantel-schaal and zur Hausen, 

25 Virol. . 134(1 ):52-63 (1984). The 1.1 kb and 2.8 khBamWl fragments were cloned 
into pBlueScript-KS(+) at BamHl and EcoRV site. The internal 0.8 kb fragment was 
cloned into BamHl site of pBlueScript-KS(+). 

These three fragments were then subcloned into the same construct to 
obtain a plasmid (pAAV-1) that contained the full sequence of AAV-1 . The p AAV-1 

30 was then tested for its ability to rescue from the plasmid backbone and package 
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infectious virus. The p AAV-1 was then transfected to 293 cells and supplied with 
adenovirus type as helper at MOI 10. The virus supernatant was used to reinfect 293 
cells. 

For Southern blot analysis, Hirt DNA was digested with Dpnl to 
remove bacteria-borne plasmid and probed with internal BamHl fragment of AAV-1. 
The membrane was then washed at high stringency conditions, which included: twice 
30 minutes with 2X SSC, 0.1% SDS at 65°C and twice 30 minutes with 0.1X SSC, 

0. 1% SDS at 65 °C. The membrane was then analyzed by both phosphor image and 
X-ray autoradiography. The results confirmed that p AAV-1 is indeed an infectious 
clone of AAV serotype 1 . 

Example 2 - Sequencing Analysis of AAV-1 

The entire AAV-1 genome was then determined by automatic sequencing and 
was found to be 4718 nucleotides in length (Figs. 1 A-1C). For sequencing, an ABI 
373 automatic sequencer as used to determine the sequences for all plasmids and PCR 
fragments related to this study using the FS dye chemistry. All sequences were 
confirmed by sequencing both plus and minus strands. These sequences were also 
confirmed by sequencing two independent clones of pAV-BM, pAV-BL and pAV- 
BR. Since the replicated form of AAV-1 DNA served as the template for sequence 
determination, these sequences were also confirmed by sequencing a series of PCR 
products using original AAV-1 seed stock as a template. 

The length of AAV-1 was found to be within the range of the other serotypes: 
AAV-3 (4726 nucleotides), AAV-4 (4774 nucleotides), AAV-2 (4681 nucleotides), 
and AAV-6 (4683 nucleotides). 

The AAV-1 genome exhibited similarities to other serotypes of adeno- 
associated viruses. Overall, it shares more than 80% identity with otherknown AAV 
viruses as determined by the computer program Megalign using default settings 
[DNASTAR, Madison, WI]. The key features in AAV-2 can also be found in AAV- 

1. First, AAV-1 has the same type of inverted terminal repeat which is capable of 
forming T-shaped hairpin structures, despite the differences at the nucleotide level 
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(Figs. 2 and 3). The sequences of right ITRs and left ITRs of AAV-1 are identical. 
The AAV TR sequence is subdivided into A, A\ B, B\ C, C\ D and D' [Bern, cited 
above]. 

These AAV ITR sequences are also virtually the same as those found in AAV- 
6 right ITR, there being one nucleotide difference in each of A and A' sequence, and 
the last nucleotide of the D sequence. Second, the AAV-2 rep binding motif 
[GCTCGCTCGCTCGCTG (SEQ ID NO: 20)] is well conserved. Such motif can 
also be found in the human chromosome 19 AAV-2 pre-integration region. Finally, 
non-structural and structural coding regions, and regulatory elements similar to those 
of other AAV serotypes also exist in AAV-1 genome. 

Although the overall features of AAV terminal repeats are very much 
conserved, the total length of the AAV terminal repeat exhibits divergence. The 
terminal repeat of AAV-1 consists of 143 nucleotides while those of AAV-2, AAV-3, 
and AAV-4 are about 145 or 146 nucleotides. The loop region of AAV-1 ITR most 
closely resembles that of AAV-4 in that it also uses TCT instead of the TTT found in 
AAV-2 and AAV-3. The possibility of sequencing error was eliminated using 
restriction enzyme digestion, since these three nucleotides are part of the Sad site 
(gagctc; nt 69-74 of SEQ ID NO: 1). The p5 promoter region of AAV-1 shows more 
variations in nucleotide sequences with other AAV serotypes. However, it still 
maintains the critical regulatory elements. The two copies of YY1 [See, Fig. 1A-1C] 
sites seemed to be preserved in all known AAV serotypes, which have been shown to 
be involved in regulating AAV gene expression. In AAV-4, there are 56 additional 
nucleotides inserted between YY1 and E-box/USF site, while in AAV-1, there are 26 
additional nucleotides inserted before the E-box/USF site. The pi 9 promoter, p40 
promoter and polyA can also be identified from the AAV-1 genome by analogy to 
known AAV serotypes, which are also highly conserved. 

Thus, the analysis of AAV terminal repeats of various serotypes showed that 
the A and A' sequence is very much conserved. One of the reasons may be the Rep 
binding motif (GCTC).,GCTG [SEQ ID NO: 20]. These sequences appear to be 
essential for AAV DNA replication and site-specific integration. The same sequence 
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has also been shown to be preserved in a monkey genome [Samulski, personal 
communication]. The first 8 nucleotides of the D sequence are also identical in all 
known AAV serotypes. This is in accordance with the observation of the Srivastava 
group that only the first 10 nucleotides are essential for AAV packaging [X.S. Wang 
5 et al, J. Virol. , 71:3077-3082 (1997); X.S. Wang et al, J. Virol. . 71:1 140-1 146 

(1997)]. The function of the rest of the D sequences still remain unclear. They may 
be somehow related to their tissue specificities. The variation of nucleotide in B and 
C sequence may also suggest that the secondary structure of the ITRs is more critical 
for its biological function, which has been demonstrated in many previous 
10 publications. 

Example 3 - Comparison of AAV-1 Sequences 

The nucleotide sequences of AAV-1, obtained as described above, were 
compared with known AAV sequences, including AAV-2, AAV-4 and AAV-6 using 
DNA Star Megalign. This comparison revealed a stretch of 71 identical nucleotides 

15 shared by AAV-1, AAV-2 and AAV-6. See, Figs. 1A-1C 

This comparison further suggested that AAV-6 is a hybrid formed by 
homologous recombination of AAV-1 and AAV-2. See, Figs. 3Aand3B. These 
nucleotides divide the AAV-6 genome into two regions. The 5* half of AAV-6 of 522 
nucleotides is identical to that of AAV-2 except in 2 positions. The 3 1 half of AAV-6 

20 including the majority of the rep gene, complete cap gene and 3' ITR is 98% identical 
to AAV-1. 

Biologically, such recombination may enable AAV-1 to acquire the ability to 
transmit through the human population. It is also interesting to note that the ITRs of 
AAV-6 comprise one AAV-1 ITR and one AAV-2 ITR. The replication model of 
25 defective parvovirus can maintain this special arrangement. Studies on AAV 

integration have shown that a majority of AAV integrants carries deletions in at least 
one of the terminal repeats. These deletions have been shown to be able to be 
repaired through gene conversion using the other intact terminal repeat as a template. 
Therefore, it would be very difficult to maintain AAV-6 as a homogenous population 
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when an integrated copy of AAV-6 is rescued from host cells with helper virus 
infection. The AAV-6 with two identical AAV-2 ITRs or two identical AAV-1 ITRs 
should be the dominant variants. The AAV-6 with two AAV-1 ITRs has been 
observed by Russell's group [Rutledge, cited above (1998)]. So far there is no report 
on AAV-6 with two AAV-2 ITRs. Acquirement of AAV-2 P5 promoter by AAV-6 
may have explained that AAV-6 have been isolated from human origin while AAV-1 
with the same virion has not. The regulation of P5 promoter between different 
species of AAV may be different in vivo. This observation suggests the capsid 
proteins of AAV were not the only determinants for tissue specificity. 

Although it is clear that AAV-6 is a hybrid of AAV-1 and AAV-2, AAV-6 has 
already exhibited divergence from either AAV-1 or AAV-2. There are two nucleotide 
differences between AAV-6 and AAV-2 in their first 450 nucleotides. There are 
about 1% differences between AAV-6 and AAV-1 in nucleotide levels from 
nucleotides 522 to the 3' end. There also exists a quite divergent region (nucleotide 
4486-4593) between AAV-6 and AAV-1 (Figs. 1A-1C). This region does not 
encode any known proteins for AAVs. These differences in nucleotide sequences may 
suggest that AAV-6 and AAV-1 have gone through some evolution since the 
recombination took place. Another possible explanation is that there exists another 
variant of AAV-1 which has yet to be identified. So far, there is no evidence to rule 
out either possibility. It is still unknown if other hybrids (AAV-2 to AAV-4, etc.) 
existed in nature. 

The coding region of AAV-1 was deduced by comparison with other known 
AAV serotypes. Table 1 illustrates the coding region differences between AAV-1 and 
AAV-6. The amino acid residues are deduced according to AAV-2. 

With reference to the amino acid position of AAV-1, Table 1 lists the amino 
acids of AAV-1 which have been changed to the corresponding ones of AAV-6. The 
amino acids of AAV-1 are shown to the left of the arrow. Reference may be made to 
SEQ ID NO: 5 of the amino acid sequence of AAV-1 Rep 78 and to SEQ ID NO: 13 
for the amino acid sequence of AAV-1 VP1. 
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Table 1 



Coding region variations between AAV-1 and AAV-6 



Rep protein (Rep78) 




Cap protein (VP1) 


Position(s) 


Amino acids 




Position(s) 


Amino acids 


28 


S-N 




129 


L-F 


191 


ChH 




418 


E-D 


192 


H-D 




531 


E-K 


308 


E-D 




584 


F-L 








598 


A-V 








642 


N-H 



It was surprising to see that the sequence of the AAV-1 coding region is 
almost identical to that of AAV-6 from position 452 to the end of coding region 
(99%). The first 508 nucleotides of AAV-6 have been shown to be identical to those 
of AAV-2 [Rutledge, cited above (1998)]. Since the components of AAV-6 genome 
seemed to be AAV-2 left ITR - AAV-2 p5 promoter - AAV-1 coding region - AAV- 
1 right ITR , it was concluded that AAV-6 is a naturally occurred hybrid between 
AAV-1 and AAV-2. 

Example 4 - Gene Therapy Vector Based on AAV-1 

Recombinant gene transfer vectors based on AAV-1 viruses were constructed 
by the methods described in Example 1. To produce a hybrid recombinant virus with 
AAV-1 virion and AAV-2 ITR, the AAV-1 trans plasmid (pAVlH) and the AAV-2 
cis-lacZ plasmid (with AAV-2 ITR) were used. The AAV-2 ITR was used in this 
vector in view of its known ability to direct site-specific integration. Also constructed 
for use in this experiment was an AAV-1 vector carrying the green fluorescent protein 
(GFP) marker gene under the control of the immediate early promoter of CMV using 
pAVlH as the trans plasmid. 
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A. rAAV-1 Viruses Transfect Host Cells in Vitro 

84-31 cells, which are subclones of 293 cells (which express 
adenovirus El a, Elb) which stably express E4/ORF5, were infected with rAAV-1 
GFP or rAAV-lacZ. High levels of expression of GFP and lacZ was detected in the 
5 cultured 84-3 1 cells. This suggested that rAAV-1 based vector was very similar to 
AAV-2 based vectors in ability to infect and expression levels. 

B. rAAV-1 Viruses Transfect Cells in Vivo 

The performance of AAV-1 based vectors was also tested in vivo. The 
rAAV-1 CMV-al AT virus was constructed as follows. The EcoRJ fragment of 

10 pAT85 (ATCC) containing human al -antitrypsin (al AT) cDNA fragment was 

blunted and cloned into PCR (Promega) at a Smal site to obtain PCR-al AT. The 
CMV promoter was cloned into PCR-al AT at the Xbal site. The Alb-al AT 
expression cassette was removed by Xhol and Clal and cloned into pAVlH at the 
Xbal site. This vector plasmid was used to generate AAV- 1 -CMV-al AT virus used 

15 in the experiment described below. 

For screening human antibodies against AAV, purified AAV virus is 
lysed with Ripa buffer (10 mM Tris pH 8.2, 1% Triton X-100, 1% SDS, 0.15 M 
NaCl) and separated in 10% SDS-PAGE gel. The heat inactivated human serum was 
used at a 1 to 1000 dilution in this assay. The rAAVrl CMV-al AT viruses were 

20 injected into Rag-1 mice through tail vein injection at different dosages. The 

concentration of human al -antitrypsin in mouse serum was measured using ELISA. 
The coating antibody is rabbit anti-human human al -antitrypsin (Sigma). The goat- 
antihuman al -antitrypsin (Sigma) was used as the primary detection antibodies. The 
sensitivity of this assay is around 0.3 ng/ml to 30 ng/ml. The expression of human a- 

25 antitrypsin in mouse blood can be detected in a very encouraging level. This result is 
shown in Table 2. 
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Table 2 



Human Antitrypsin Expressed in Mouse Liver 



Amount of virus injected 


Week 2 (ng/ml) 


Week 4 (ng/ml) 


2x1 0 10 genomes 


214.2 


171.4 


lxl 0'° genomes 


117.8 


109.8 


5x10'° genomes 


64.5 


67.8 


2.5x10'° genomes 


30.9 


58.4 



rAAV-1 CMV-lacZ viruses were also injected into the muscle of 
C57BL6 mice and similar results were obtained. Collectively, these results suggested 
10 that AAV-1 based vector would be appropriate for both liver and muscle gene 
delivery. 

Example 5 - Neutralizing Antibodies Against AAV- 1 

Simple and quantitative assays for neutralizing antibodies (NAB) to AAV-1 
and AAV-2 were developed with recombinant vectors. A total of 33 rhesus monkeys 

15 and 77 normal human subjects were screened. 
A. Nonhitman Primates 

Wild-caught juvenile rhesus monkeys were purchased from Covance 
(Alice, Tex.) and LABS of Virginia (Yemassee, SC) and kept in full quarantine. The 
monkeys weighed approximately 3 to 4 kg. The nonhuman primates used in the 

20 Institute for Human Gene Therapy research program are purposefully bred in the 
United States from specific-pathogen-free closed colonies. All vendors are US 
Department of Agriculture class A dealers. The rhesus macaques are therefore not 
infected with important simian pathogens, including the tuberculosis agent, major 
simian lentiviruses (simian immunodeficiency virus and simian retroviruses), and 

25 cercopithecine herpesvirus. The animals are also free of internal and external 
parasites. The excellent health status of these premium animals minimized the 
potential for extraneous variables. For this study, serum was obtained from monkeys 
prior to initiation of any protocol. 
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NAB titers were analyzed by assessing the ability of serum antibody to 
inhibit the transduction of reporter virus expressing green fluorescent protein (GFP) 
(AAV1-GFP or AAV2-GFP) into 84-31 cells. Various dilutions of antibodies 
preincubated with reporter virus for 1 hour at 37°C were added to 90% confluent cell 
5 cultures. Cells were incubated for 48 hours and the expression of green fluorescent 
protein was measured by Fluorolmaging (Molecular Dynamics). NAB titers were 
calculated as the highest dilution at which 50% of the cells stained green. 

Analysis of NAB in rhesus monkeys showed that 61% of animals 
tested positive for AAV-1; a minority (24%) has NAB to AAV-2. Over one-third of 
10 animals had antibodies to AAV-1 but not AAV-2 (i.e., were monospecific for AAV- 
1), whereas no animals were positive for AAV-2 without reacting to AAV-1 . These 
data support the hypothesis that AAV-1 is endemic in rhesus monkeys. The presence 
of true AAV-2 infections in this group of nonhuman primates is less clear, since cross- 
neutralizing activity of an AAV-1 response to AAV-2 can not be ruled out. It is 
15 interesting that there is a linear relationship between AAV-2 NAB and AAV-1 NAB 
in animals that had both. 
B. Humans 

For these neutralization antibody assays, human serum samples were 
incubated at 56°C for 30 min to inactivate complement and then diluted in DMEM. 

20 The virus (rAAV or rAd with either lacZ or GFP) was then mixed with each serum 
dilution (20X, 400X, 2000X, 4000X, etc.) and incubated for 1 hour at 37°C before 
applied to 90% confluent cultures of 84-31 cells (for AAV) or Hela cells (for 
adenovirus) in 96-well plates. After 60 minutes of incubation at culture condition, 
100 \i\ additional media containing 20% FCS was added to make final culture media 

2 5 containing 1 0% FCS. 
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The result is summarized in Table 3. 



Table 3 



Adenovirus 


AAV-1 


A A \ 7 O 

AAV-2 


# of samples 


Percentage 








H l 


JJ.Z/0 


+ 






16 


20.8% 




+ 




0 


0.0% 






+ 


2 


2.6% 




+ 


+ 


2 


2.6% 


+ 




+ 


3 


3.9% 


+ 


• + 




0 


0.0% 


+ 


+ 


+ 


13 


16.9% 




Total 


77 


100% 



The human neutralizing antibodies against these three viruses seemed to be 
unrelated since the existence of neutralizing antibodies against AAV are not 
indications for antibodies against adenovirus. However, AAV requires adenovirus as 

15 helper virus, in most of the cases, the neutralizing antibodies against AAV correlated 
with the existence of neutralizing antibodies to adenovirus. Among the 77 human 
serum samples screened, 41% of the samples can neutralize the infectivity of 
recombinant adenovirus based on Ad5. 15/77 (19%) of serum samples can neutralize 
the transduction of r AAV-1 while 20/77 (20%) of the samples inhibit rAAV-2 

20 transduction at 1 to 80 dilutions or higher. All serum samples positive in neutralizing 
antibodies for AAV-1 in are also positive for AAV-2. However, there are five (6%) 
rAAV-2 positive samples that failed to neutralize r AAV-1 . In samples that are 
positive for neutralizing antibodies, the titer of antibodies also varied in the positive 
ones. The results from screening human sera for antibodies against AAVs supported 

25 the conclusion that AAV-1 presents the same epitome as that of AAV-2 to interact 
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with cellular receptors since AAV-1 neutralizing human serums can also decrease the 
infectivity of AAV-2. However, the profile of neutralizing antibodies for these AAVs 
is not identical, there are additional specific receptors for each AAV serotype. 

Example 6 - Recombinant AAV Viruses Exhibit Tissue Tropism 
5 The recombinant AAV-1 vectors of the invention and the recombinant AAV-2 

vectors [containing the gene encoding human a 1 -antitrypsin (al AT) or murine 
erythropoietin (Epo) from a cytomegalovirus-enhanced p-actin promoter (CB)] were 
evaluated in a direct comparison to equivalent copies of AAV-2 vectors containing 
the same vector genes. 

10 Recombinant viruses with AAV-1 capsids were constructed using the 

techniques in Example 1. To make rAAV with AAV-1 virions, pAVlH or p5E18 
(2/1) was used as the trans plasmid to provide Rep and Cap functions. For the 
generation of the rAAV based on AAV-2, p5E 18(2/2) was used as the trans plasmid, 
since it greatly improved the rAAV yield. [Early experiments indicated similar //; vivo 

15 performances of AAV-1 vectors produced with pAVlH and p5E19 (2/1). All 

subsequent studies used AAV-1 vectors derived from p5E 18(2/1) because of the 
increased yield.] 

Equivalent stocks of the AAV-1 and AAV-2 vectors were injected 

intramuscularly (5 x 10 10 genomes) or liver via the portal circulation (1 x 10 n 
20 genomes) into immunodeficient mice, and the animals (four groups) were analyzed on 

day 30 for expression of transgene. See, Figs. 4 A and 4B. 

AAV-2 vectors consistently produced 10- to 50-fold more serum 

erythropoietin or a 1 -antitrypsin when injected into liver compared to muscle. 

(However, the AAV-1 -delivered genes did achieve acceptable expression levels in the 
25 liver.) This result was very different from that for AAV-1 vectors, with which muscle 

expression was equivalent to or greater than liver expression. In fact, AAV-1 

outperformed AAV-2 in muscle when equivalent titers based on genomes were 

administered. 



WO 00/28061 



PCT/US99/25694 



Example 7 - Gene Delivery via r AAV-1 

C57BL/6 mice (6- to 8-week old males, Jackson Laboratories) were analyzed 
for AAV mediated gene transfer to liver following intrasplenic injection of vector (i.e., 
targeted to liver). A total of 10 n genome equivalents of rAAV-1 or rAAV-2 vector 
5 were injected into the circulation in 100 ^1 buffered saline. The first vector contained 
either an AAV-1 capsid or an AAV-2 capsid and expressed al AT under the control of 
the chicken P-actin (CB) promoter. Day 28 sera were analyzed for antibodies against 
AAV-1 or AAV-2 and serum al AT levels were checked. Animals were then injected 
with an AAV-1 or AAV-2 construct expressing erythropoietin (Epo, also under the 

10 control of the CB promoter). One month later sera was analyzed for serum levels of 
Epo. The following groups were analyzed (Figs. 5 A-5D). 

In Group 1, vector 1 was AAV-2 expressing al AT and vector 2 was AAV-2 
expressing Epo. Animals generated antibodies against AAV-2 following the first 
vector administration which prevented the readministration of the AAV-2 based 

15 vector. There was no evidence for cross-neutralizing the antibody to AAV-1 . 

In Group 2, vector 1 was AAV-1 expressing al AT while vector 2 was AAV-1 
expressing Epo. The first vector administration did result in significant al AT 
expression at one month associated with antibodies to neutralizing antibodies to 
AAV-1. The animals were not successfully readministered with the AAV-1 Epo 

2 0 expressing construct. 

In Grpup 3, the effectiveness of an AAV-2 vector expressing Epo injected into 
a naive animal was measured. The animals were injected with PBS and injected with 
AAV-2 Epo vector at day 28 and analyzed for Epo expression one month later. The 
neutralizing antibodies were evaluated at day 28 so we did not expect to see anything 

2 5 since they received PBS with the first vector injection. This shows that in naive 

animals AAV-2 is very efficient at transferring the Epo gene as demonstrated by high 
level of serum Epo one month later. 

Group 4 was an experiment similar to Group 3 in which the animals originally 
received PBS for vector 1 and then the AAV-1 expressing Epo construct 28 days 

30 later. At the time of vector injection, there obviously were no antibodies to either 
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AAV-1 or AAV-2. The AAV-1 based vector was capable of generating significant 
expression of Epo when measured one month later. 

Group 5 is a cross-over experiment where the initial vector is AAV-2 
expressing al AT followed by the AAV-1 construct expressing Epo. The animals, as 
5 expected, were efficiently infected with the AAV-2 vector expressing a 1 AT as shown 
by high levels of the protein in blood at 28 days. This was associated with significant 
neutralizing antibodies to AAV-2. Importantly, the animals were successfully 
administered AAV-1 following the AAV-2 vector as shown by the presence of Epo in 
serum 28 days following the second vector administration. At the time of this vector 

10 administration, there was high level AAV-2 neutralizing antibodies and very low 

cross-reaction to AAV-1 . The level of Epo was slightly diminished possibly due to a 
small amount of cross-reactivity. Group 6 was the opposite cross-over experiment in 
which the initial vector was AAV-1 based, whereas the second experiment was AAV- 
2 based. The AAV-1 vector did lead to significant gene expression of al AT, which 

15 also resulted in high level AAV-1 neutralizing antibody. The animals were very 

efficiently administered AAV-2 following the initial AAV-1 vector as evidenced by 
high level Epo. 

A substantially identical experiment was performed in muscle in which 5 x 10 10 
genomes were injected into the tibialis anterior of C57BL/6 mice as a model for 
20 muscle directed gene therapy. The results are illustrated in Figs. 6A-6D and are 
essentially the same as for liver. 

In summary, this experiment demonstrates the utility of using an AAV-1 
vector in patients who have pre-existing antibodies to AAV-2 or who had initially 
received an AAV-2 vector and need readministration. 

25 Example 8 - Construction of Recombinant Viruses Contain ing AAV-1 ITRs 

This example illustrates the construction of recombinant AAV vectors which 

contain AAV-1 ITRs of the invention. 

An AAV-1 cis plasmid is constructed as follows. A 160 bp Xho-Nrul AAV-1 

fragment containing the AAV-1 5' ITR is obtained from pAVl-BL. pAVl-BL was 
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generated as described in Example 1. The Xho-Nrul fragment is then cloned into a 
second pAVl-BL plasmid at an Xbal site to provide the plasmid with two AAV-1 
ITRs. The desired transgene is then cloned into the modified pAV-lBL at the Nrul 
and BamHI site, which is located between the AAV-1 ITR sequences. The resulting 
5 AAV-1 cis plasmid contains AAV-1 ITRs flanking the transgene and lacks functional 
AAV-1 rep and cap. 

Recombinant AAV is produced by simultaneously transfecting three plasmids 
into 293 cells. These include the AAV-1 cis plasmid described above; a trans plasmid 
which provides AAV rep/cap functions and lacks AAV ITRs; and a plasmid providing 

10 adenovirus helper functions. The rep and/or cap functions may be provided in trans 
by AAV-1 or another AAV serotype, depending on the immunity profile of the 
intended recipient. Alternatively, the rep or cap functions may be provided in cis by 
AAV-1 or another serotype, again depending on the patient's immunity profile. 

In a typical cotransfection, 50 \xg of DNA (cis:trans:helper at ratios of 1:1:2, 

15 respectively) is transfected onto a 15 cm tissue culture dish. Cells are harvested 96 
hours post transfection, sonicated and treated with 0.5% sodium deoxycholate (37° 
for 10 min). Cell lysates are then subjected to 2-3 rounds of ultracehtrifugation in a 
cesium gradient. Peak fractions containing rAAV are collected, pooled and dialyzed 
against PBS. A typical yield is 1 x 10 u genomes/ 10 9 cells. 

20 Using this method, one recombinant virus construct is prepared which contains 

the AAV-1 ITRs flanking the transgene, with an AAV-1 capsid. Another recombinant 
virus construct is prepared with contains the AAV-1 ITRs flanking the transgene, with 
an AAV-2 capsid. 

All publications cited in this specification are incorporated herein by reference. 
25 While the invention has been described with reference to a particularly preferred 

embodiments, it will be appreciated that modifications can be made without departing 
from the spirit of the invention. Such modifications are intended to fall within the 
scope of the claims. 
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What is claimed is: 

1 . An isolated AAV-1 nucleic acid molecule comprising a sequence 
selected from the group consisting of: 

(a) SEQIDNO: 1; 

(b) a DNA sequence complementary to SEQ ID NO: 1 ; 

(c) cDNA complementary to (a) or (b); and 

(d) RNA complementary to any of (a) to (c). 

2. A nucleic acid molecule comprising an AAV-1 inverted terminal repeat 
(ITR) sequence selected from the group consisting of: 

(a) nt 1 to 143 of SEQ ID NO: 1; 

(b) nt 4576 to 47 18 of SEQ ID NO: 1; 

(c) a nucleic acid sequence complementary to (a) or (b); and 

(d) a functional fragment of (a), (b), or (c). 

3. A recombinant vector comprising a 5* AAV-1 inverted terminal repeat 
(ITR) and a selected transgene, wherein said ITR has the sequence selected from the 
group consisting of: 

(a) nt 1 to 143 of SEQ ID NO: 1; 

(b) a nucleic acid sequence complementary to (a); and a 

(c) a functional fragment of (a) or (b). 

4. The recombinant vector according to claim 3, wherein said vector 
further comprises a 3' AAV-1 ITR. 
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5. A recombinant vector comprising a 3' AAV-1 inverted terminal repeat 
(ITR) and a selected transgene, wherein said ITR has the sequence selected from the 
group consisting of: 

(a) nt4576to4718ofSEQIDNO: 1; 

(b) a nucleic acid sequence complementary to (a); and 

(c) a functional fragment of (a) or (b). 

6. The recombinant vector according to claim 5, wherein said vector 
further comprises a 5' AAV-1 ITR. 

7. The recombinant vector according to any of claims 3-6, wherein said 
vector further comprises AAV-1 capsid proteins having the sequence of SEQ ID NO: 
13, 15 or 17 or functional fragments thereof 

8. The recombinant vector according to any of claims 3-6, wherein said 
vector further comprises adenovirus sequences. 

9. A recombinant vector comprising an AAV-1 P5 promoter having the 
sequence of nt 236 to 299 of SEQ ID NO: 1 or a functional fragment thereof. 

10. A nucleic acid molecule encoding AAV-1 helper functions, said 
molecule comprising an AAV rep coding region and an AAV cap coding region, 
wherein said cap coding region comprises at least one member is selected from the 
group consisting of: 

(a) vpl, nt 2223 to 4431 of SEQ ID NO: 1; 

(b) vp2, nt 2634 to 4432 of SEQ ID NO: 1; and- 

(c) vp3, nt 2829 to 4432 of SEQ ID NO: 1 . 
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11. A nucleic acid molecule encoding AAV-1 helper functions, said 
molecule comprising an AAV rep coding region and an AAV cap coding region, 
wherein said rep coding region comprises an AAV-1 rep coding region comprising at 
least one member selected from the group consisting of: 

(a) rep 78, nt 335 to 2304 of SEQ ID NO: 1; 

(b) rep 68, nt 335 to 2272 of SEQ ID NO: 1 or the cDNA 
corresponding thereto; 

(c) rep 52, nt 1007 to 2304 of SEQ ID NO: 1; and 

(d) rep 40, nt 1007 to 2272 of SEQ ID NO: 1 or the cDNA 
corresponding thereto. 

12. A host cell transduced with a recombinant viral vector according to 
any of claims 3-6. 

13. A host cell transduced with a nucleic acid molecule according to any of 
claims 1,2, 10 or 11. 

14. A host cell stably transduced with an AAV-1 P5 promoter having the 
sequence of nt 236 to 299 of SEQ ID NO: 1. 

15. A pharmaceutical composition comprising a carrier and a virus 
comprising the vector according to any of claims 3-6. 

16. A pharmaceutical composition comprising a carrier and a virus 
comprising the vector according to claim 7. 

17. A pharmaceutical composition comprising a carrier and a virus 
comprising the vector according to claim 8. 
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18. A method for AAV-mediated delivery of a transgene comprising the 
step of delivering to a host cell an AAV virion which comprises: 

(a) a capsid comprising at least one capsid protein encoded by an 
AAV-1 cap gene; and 

(b) a DNA molecule comprising a transgene under the control of 
regulatory sequences directing its expression. 

19. A method for AAV-mediated delivery of a transgene to a host 
comprising the steps of: 

(a) assaying a sample from the host to determine the presence of 
neutralizing antibodies specific against any serotype of AAV; and 

(b) delivering to the host an AAV virion which comprises: 

(i) a capsid comprising at least one capsid protein encoded 
by a cap gene of an AAV serotype against which the host has no antibodies as 
determined in step (a); and 

(ii) a DNA molecule comprising a transgene under the 
control of regulatory sequences directing its expression. 

20. The method according to claim 19, comprising the additional step of 
repeating steps (a) and (b). 

21. Use of an AAV virion which comprises a capsid comprising (a) at 
least one capsid protein encoded by a cap gene of an AAV serotype against which the 
host has antibodies, and (b) a DNA molecule comprising a transgene operably linked 
to regulatory sequences directing its expression, 

in the preparation of a medicament for delivery of a transgene to a 
host, wherein said host has no preexisting neutralizing antibodies against the AAV 
serotype of said cap gene. 
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22. A method for delivery of a transgene comprising the step of delivering 
to a host cell a recombinant virus comprising a recombinant vector according to any 
of claims 3-8. 



23. A method for producing a selected gene product comprising the steps 
of transfecting a mammalian cell with the molecule according to claim 1 or a 
functional fragment thereof and culturing said cell under conditions suitable to express 
said gene product. 
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FIG 1A 

AAV-1 ttgcccactccctctctgcgcgctcgctcgctcggtggggcctgcggaccaaaggtccgc 

AAV-2 . . .g • ■ . . .ac . .a . . . .g .gc gc. 

AAV-6 ...g ac.a g.gc gc. 60 

Rep binding site 



AAV-1 agacggcagagctctgctctgccggccc 



AAV-2 c c.c.g...t...c.g.g t..gt, 

AAV-6 c c.c.g. . .t. . .c.g.g q. .gt. 



caccgagcgagcgagcgcgcagagagggagtg 120 



AAV-1 CGTAAATTACGTCATAGGG GAGTGGTCCTGTATTAGCT GTCACGTGjAGTGCTTTTGC 237 

AAV-2 . . .G TTA.G.A AG 

AAV-6 ...G TTA.G.A A G) 



60 
60 



120 
120 



TRS^ 

AAV-1 ggcaactccatcactaggggtaaTCGCGAAGCGCCTCCCACGCTGCCGCGTCAGCGCTGA 180 

AAV-2 . c — ..ct.G..G. .TG.A...G ... 163 

AAV-6 .c — ..ct.G..G. .TG.A...G ... 163 

E box/USF 



222 
222 



YY1 P 5 /TATA 

AAV-1 GACATTTTGCGACACCACGTGGCCATTTAGGGTATATATGGCCGAGTGAGCGAGCAGGAT 297 

^V-2 T....T..CGCT T..A.C AC G. 282 

AAV-6 T....T..CGCT T. .A.C AC. G. 282 

Y Yl/p5 RN A Rep_J78/68 

AAV-1 CTCCATTTTGAC-CGCGAAATTTGAACGAGC AGCAGCCATGCCGGGCTTCTACGAGATCG 356 

AAV-2 AG..G..GG C C G..T T. 342 

AAV-6 AG..G..GG... C - G..T T. 341 



AAV- 1 TGATCAAGGTGCCGAGCGACCTGGACGAGC ACCTGCCGGGCATTTCTGACTCGTTTGTGA 416 

AAV-2 T C..C T G...T C AGC 402 

AAV-6 T C..C T T C AGC 



401 



AAV-1 GCTGGGTGGCCGAGAAGGAATGGGAGCTGCCCCCGGATTCTGACATGGATCTGAATCTGA 47 6 

AAV-2 A T...-G..A 462 

AAV-6 A T....G..A 461 



AAV- 1 TTGAGCAGGCACCCCTGACCGTGGCCG AGAAGCTGCAGCGCGACTTCCTGGTCCAATGGC 536 

AAV-2 : T...ACGG 522 

AAV-6 G ' * ' 



521 



AAV- 1 GCCGCGTGAGTAAGGCCCCGGAGGCCCTCTTCTTTGTTCAGTTCGAGAAGGGCGAGTCCT 596 

AAV-2 T T G..A..T A... AG.. 582 

AAV-6 



581 



AAV-1 ACTTCCACCTCCATATTCTGGTGGAGACCACGGGGGTCAAATCCATGGTGCTGGGCCGCT 65 6 

AAV-2 A.G..CG.G..C A C G ...TT A..T. 642 

AAV-6 



641 



AAV-1 TCCTGAGTCAGATTAGGGACAAGCTGGTGCAGACCATCTACCGCGGGATCGAGCCGACCC 716 

AAV-2 C.C..A..A...A.T GA..T TT 702 

AAV-6 



701 



AAV-1 TGCCCAACTGGTTCGCGGTGACCAAGACGCGTAATGGCGCCGGAGGGGGGAACAAGGTGG 77 6 

AAV-2 A C..A CA.A C 762 

AAV-6 



761 
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FIG IB 

AAV- 1 TGGACGAGTGCTACATCCCCAACT ACCTCCTGCCCAAGACTCAGCCCGAGCTGCAGTGGG 836 

AAV-2 :...T T...T.G..C A. .C T.....C 822 

AAV-6 • ' 



821 



P 19/TATA PI 9 RNA^ 

AAV- 1 CGTGGACTAACATGG AGGAGTATATAAGCGCCTGTTTGAACCTGGCCGAGCGCAAACGGC 



896 

AAV-2 T AC T C G..T..CA.G T T 882 



AAV-6 C GG. 



,G A..C..GG.C 881 



AAV- 1 TCGTGGCGCAGCACCTGACCCACGTCAGCCAGACCCAGGAGCAGAACAAGGAGAATCTGA 956 

AAV-2 .G ..T G GTCG G A A.. 942 

AAV-6 CG 



941 



Rep 52 /40 

AAV- 1 ACCCC AATTCTGACGCGCCTGTCATCCGGTC AAAAACCTCCGCGCGCTAC ATGGAGCTGG 1016 
AAV-2 T T G..G...A.A T..A..CA.G 1002 

S-6 : ::::: * 1001 

AAV-l TCGGGTGGCTGGTGGACCGGGGCATCACCTCCGAGAAGCAGTGGATCCAGGAGGACCAGG 1076 

™« c AA --- G -- T 6 ::::::::::::::::::::::: iSS 

AAV-6 

AAV-l CCTCGTACATCTCCTTCAACGCCGCTTCCAACTCGCGGTCCCAGATCAAGGCCGCTCTGG 1136 
AAV-2 A T..G..C A T..CT... 1122 

rtAV £ ... 1121 

AAV-6 

AAV-l ACAATGCCGGCAAGATCATGGCGCTGACCAAATCCGCGCCCGACTACCTGGTAGGCCCCG 1196 

AAV-2 G..A T...AGC T...A....C G....AGC 1182 

AAV-6 1181 

AAV-l CTCCGCCCGCGGACATTAAAACCAACCGCATCTACCGCATCCTGGAGCTGAACGGCTACG 1256 

AAV-2 AG..CGTG.A TCC .G. . .T. .G. .T. . TAAA . . TT .... A A G.... 1242 

AAV-6 C T 1241 

AAV-l AACCTGCCT ACGCCGGCTCCGTCTTTCTCGGCTGGGCCC AGAAAAGGTTCGGG AAGCGC A 1316 

AAV-2 .T..CCAA..T..G.CT G. .A AC A 

AAV-6 .C A "'- A 

AAV-l ACACCATCTGGCTGTTTGGGCCGGCCACCACGGGCAAGACCAACATCGCGGAAGCCATCG 1376 

AAV-2 T. .A. .T. .C. .G G A. 1362 

AAV-6 

AAV-l CCCACGCCGTGCCCTTCTACGGCTGCGTCAACTGGACCAATGAGAACTTTCCCTTCAATG 1436 
^ ht G A 1422 

AAV-2 A-l « 1421 

AAV-6 C * iqZ1 

AAV-l ATTGCGTCGACAAGATGGTGATCTGGTGGGAGGAGGGCAAGATGACGGCCAAGGTCGTGG 1496 

AAV-2 .C..T G C: 1482 



AAV-6 



1481 



AAV-l AGTCCGCC AAGGCC ATTCTCGGCGGC AGC AAGGTGCGCGTGGACCAAAAGTGC AAGTCGT 1556 

AAV-2 ....G A A.. A G. .A C. 1542 

AAV-6 



1541 



AAV-l CCGCCCAGATCGACCCCACCCCCGTGATCGTCACCTCCAACACCAACATGTGCGCCGTGA 1616 

AAV-2 .G A.....G..T 

aav-fi T 
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FIG 1C 

AAV-1 TTGACGGGAACAGCACCACCTTCGAGCACCAGCAGCCGTTGCAGGACCGGATGTTCAAAT 167 6 

AAV-2 TCA..G A -A.. 1662 

AAV-6 . 1661 

AAV-1 TTGAACTCACCCGCCGTCTGGAGCATGACTTTGGCAAGGTGACAAAGCAGGAAGTCAAAG 1740 

AAV-2 T G C..C 1722 

AAV-6 1721 

AAV-1 AGTTCTTCCGCTGGGCGCAGGATCACGTGACCGAGGTGGCGCATGAGTTCTACGTCAGAA 17 96 

AAV-2 .C..T G AA GTT A A A.. 1782 

AAV-6 1781 

P 4Q/TAT A 

AAV-1 AGGGTGGAGCCAACAAAAGACCCGCCCCCGATGACGCGGATAAAAGCGAGCCCAAGCGGG 18 56 

AAV-2 G AG A T...T A 1842 

AAV-6 G • 1841 

P40 RNA^ 

AAV-1 CCTGCCCCTCAGTCGCGGATCCATCGACGTCAGACGCGGAAGGAGCTCCGGTGGACTTTG 1916 

AAV-2 TGC..GAG T. . .C.G . . T . . A . CA . . . AC . 1899 

AAV-6 1901 

T 

AAV-1 CCGACAGGTACCAAAACAAATGTTCTCGTCACGCGGGCATGCTTCAGATGCTGTTTCCCT 1976 

AAV-2 .A T AA..T 1959 

AAV-6 1961 

AAV- 1 GCAAGACATGCGAGAGAATGAATCAG AATTTC AAC ATTTGCTTC ACGC ACGGG ACGAGAG 2036 

AAV-2 ...GACA CA. .T. .C T ACA. .A. . 2019 

AAV-6 A C 2021 

AAV-1 ACTGTTCAGAGTGCTTCCCCGGCGTGTCAGAATCTCAACCGGTC GTCAGAAAGAGGA 2093 

AAV-2 T T.. C. .TTCT. . .GTC. .A.A.G 2076 

AAV-6 A..T 2078 

AAV- 1 CGTATCGGAAACTCTGTGCCATTCATCATCTGCTGGGGCGGGCTCCCGAGATTGCTTGCT 2153 

AAV-2 A G..CTA A.CA AAA..TG..A.. C. A 2133 

AAV-6 . 2138 

Rep 78 stop 

AAV-1 CGGCCTGCGATCTGGTCAACGTGGACCTGGATGACTGTGTTTCTGAGCAATAAATGACTT 2213 

AAV-2 .T T TT CA.C.T...A T.. 2193 

AAV-6 T 2193 

V VP1 V Rep 68 stop 

AAV- 1 AAACCAGGTATGGCTGCCGATGGTT ATCTTCCAGATTGGCTCGAGGACAACCTCTCTGAG 2273 

AAV-2 T CT A 2253 

AAV-6 --AC G 2258 

AAV-1 GGCATTCGCGAGTGGTGGGACTTGAAACCTGGAGCCCCGAAGCCCAAAGCCAACCAGCAA 2333 

AAV-2 ..A..AA.AC A.GC.C CC . A . . ACCA . . A . . GC . . GCAG . . . GG 2313 

AAV-6 A 2318 

AAV-1 AAGCAGGACGACGGCCGGGGTCTGGTGCTTCCTGGCTACAAGTACCTCGGACCCTTCAAC 2393 

AAV-2 C.TA A.. A T G 2 373 

AAV-6 G..C G C 2 378 
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FIG ID 

AAV - 1 GGACTCGACAAGGGGGAGCCCGTCAACGCGGCGGACGCAGCGGCCCTCGAGCACGACAAG 2453 

AAV-2 A G... A. ..A C A 2433 

AAV-6 T ... 2438 

AAV- 1 GCCTACGACCAGCAGCTCAAAGCGGGTGACAATCCGTACCTGCGGTATAACCACGCCGAC 2513 

AAV-2 G G.CAGC..A C CAA...C 2493 

AAV-6 A.AGCG..T T GCG...T 2498 

AAV-1 GCCGAGTTTCAGGAGCGTCTGCAAGAAGATACGTCTTTTGGGGGCAACCTCGGGCGAGCA 2573 

AAV-2 . .G C..TA A 2553 

AAV-6 ..C T..GC G 2558 

AAV-1 GTCTTCCAGGCCAAGAAGCGGGTTCTCGAACCTCTCGGTCTGGTTGAGGAAGGCGCTAAG 2633 

AAV-2 G..A...A T G. .C CCT.T.... 2613 

AAV-6 A T.T T 2618 

VP 2 

AAV- 1 ACGGCTCCTGGAAAG AAACGTCCGGT AGAGC AGTCGCC AC AAGAGCC AGACTCCTCCTCG 2693 

AAV-2 G A..GA.G C..T..TGTG 2673 

AAV-6 T ..... G . . AC . T G. .G. .ACAA 2678 

AAV- 1 GGCATCGGCAAGACAGGCCAGCAGCCCGCTAAAAAGAGACTCAATTTTGGTCAGACTGGC 2753 

AAV-2 . .A.C. . .A. . .G.G T. . A.G. . .A. . .T.G ...A 2733 

AAV-6 T 2738 

AAV- 1 GACTC AG AGTCAGTCCCCGATCC AC AACCTCTCGG AGAACCTCCAGCAACCCCCGCTGCT 2813 

AAV-2 . . . G C A. .T. .C. .C. .G C.G..A G T...G. 2793 

AAV-6 ...T G C. .C. .C. .A. .A G.A. .T A G 2798 

VP3 

AAV- 1 GTGGGACCTACT AC AATGGCTTC AGGCGGTGGCGC ACCAATGGCAGAC AATAACG AAGGC 2873 

AAV-2 C A A G A A G 2853 

AAV-6 2858 

AAV-1 GCCGACGGAGTGGGTAATGCCTCAGGAAATTGGCATTGCGATTCCACATGGCTGGGCGAC 2933 

AAV-2 T C A 2913 

AAV-6 • 2918 

AAV-1 AGAGTCATCACCACCAGCACCCGCACCTGGGCCTTGCCCACCTACAATAACCACCTCTAC 2993 

AAV-2 A C C 2973 

AAV-6 A.. A T..C 2978 

AAV- 1 AAGC AAATCTCC AGTGCTTCAACGGGGGCC AGC AACG AC AACCACTACTTCGGCT ACAGC 3053 

AAV-2 ..A T CCAA ... ..A... TCG T T 3030 

AAV-6 3038 

AAV- 1 ACCCCCTGGGGGTATTTTGATTTC AAC AGATTCC ACTGCCACTTTTC ACCACGTGACTGG 3113 

AAV-2 T C s 3 °90 

AAV-6 T..C 3098 

AAV-1 CAGCGACTCATCAACAACAATTGGGGATTCCGGCCCAAGAGACTCAACTTCAAACTCTTC 3173 

AAV-2 ..AA C A G T 3150 

AAV-6 G 3158 

AAV- 1 AACATCC AAGTCAAGG AGGTC ACGACG AATG ATGGCGTC ACAACCATCGCTAATAACCTT 3233 

AAV-2 T A CA C. .TACG. .G. .G. .T. .C 3210 

AAV-6 G 3218 
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FIG IE 

AAV-1 ACCAGCACGGTTCAAGTCTTCTCGGACTCGGAGTACCAGCTTCCGTACGTCCTCGGCTCT 32 93 

AAV-2 G..G..TA.T • C. G 3270 

AAV-6 T.G 3278 

AAV-1 GCGCACCAGGGCTGCCTCCCTCCGTTCCCGGCGGACGTGTTCATGATTCCGCAATAGGGC 3353 

AAV-2 T..A..A G A. .A C G.G. .A. .G. .T. .A 3330 

AAV-6 G 3338 

AAV- 1 TACCTG ACGCTC AACAATGGCAGCCAAGCCGTGGGACGTTCATCCTTTTACTGCCTGGAA 3413 

AAV-2 C..C..G C..G..T. .G. .A. .A C. .T. .A G 3390 

AAV-6 A G. .A G 3398 

AAV-1 TATTTCCCTTCTCAGATGCTGAGAACGGGCAACAACTTTACCTTCAGCTACACCTTTGAG 3473 

AAV-2 ..C..T C.T..C..A T 3450 

AAV-6 A. .G T C... 3458 

AAV- 1 GAAGTGCCTTTCCACAGCAGCTACGCGCACAGCCAGAGCCTGGACCGGCTGATGAATCCT 3533 

AAV-2 ..C..T T T T..C 3510 

AAV-6 ..C 3498 

AAV-1 CTCATCGACCAATACCTGTATTACCTGAACAGAACTCAAAATCAGTCCGGAAGTGCCCAA 3593 

AAV-2 G T...G AA.C.C. .CAAGT CCA. .ACG 3570 

AAV-6 G - G 3578 

AAV-1 AACAAGGACTTGCTGTTTAGCCGTGGGTCTCCAGCTGGCATGTCTGTTCAGCCCAAAAAC 3 653 

AAV-2 C . GTCAAGGC . T . A TCT . AG . CCGGAG . GAG . .A. . .TCGG.AC. . .T.T.GG. . . 3630 

AAV-6 G 3638 

AAV-1 TGGCTACCTGGACCCTGTTATCGGCAGCAGCGCGTTTCTAAAACAAAAACAGACAACAAC 3713 

AAV-2 T C..C A . . A . CA . . G . . . TCTG . G . . T 3690 

AAV-6 C 3698 

AAV-1 AACAGCAATTTTACCTGGACTGGTGCTTCAAAATATAACCTCAATGGGCGTGAATCCATC 3773 

AAV-2 TG . A . ACT . G A. . . A.C. .G. .CC CA.A. .C. .TC.G 3750 

AAV-6 C T T..A 3758 

AAV- 1 ATCAACCCTGGC ACTGCT ATGGCCTCACAC AAAG ACG ACGAAGACAAGTTCTTTCCCATG 3833 

AAV-2 G.G..T..G..GC.C..C AAGC G T A T.....TCA. 3810 

AAV-6 ... . •••• A 3818 

AAV-1 AGCGGTGTCATGATTTTTGGAAAAGAGAGCGCCGGAGCTTCAAACACTGCATTGGACAAT 3893 

AAV-2 G..TC.C..C G . . GC . AG . . T . A . AGAAAA TGTGAACA . T . . A . . G 3870 

AAV-6 G 3878 

AAV-1 GTCATGATTACAGACGAAGAGGAAATTAAAGCCACTAACCCTGTGGCCACCGAAAGATTT 3953 
AAV-2 CGG.A.A. .C. .T. .C T. .G. .GCAG.A. 3930 

aav-6 ;!!;;".!!c!!!!!! c c... 393 s 

AAV- 1 GGGACCGTGGCAGTCAATTTCCAG AGC AGC AGC AC AGACCCTGCGACCGGAGATGTGC AT 4013 

AAV-2 . . TT . T . . AT . TAC . . . CC AG. . .A. .G.C.AG.A. .T C CA.C 3990 

AAV-6 T C 3998 

AAV-1 GCTATGGGAGCATTACCTGGCATGGTGTGGCAAGATAGAGACGTGTACCTGCAGGGTCCC 4073 

AAV-2 A.ACAA. .C.TTC.T. .A C G. .C T T G. . . 4050 

AAV-6 T C A C A T 4058 
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FIG IF 

AAV-1 ATTTGGGCCAAAATTCCTCACACAGATGGACACTTTCACCCGTCTCCTCTTATGGGCGGC 4133 

AAV- 2 ..C A. .G A G. .C T C C..C ..T..A 4110 

AAV-6 . G • -C 4118 

AAV-1 TTTGGACTCAAGAACCCGCCTCCTCAGATCCTCATCAAAAACACGCCTGTTCCTGCGAAT 4193 

AAV-2 ..C T..AC T A T G C..G..A 4170 

AAV-6 T...C 4178 

AAV-1 CCTCCGGCGGAGTTTTCAGCTACAAAGTTTGCTTCATTCATCACCCAATACTCCACAGGA 4253 

AAV-2 . . .T. .A.CACC. .CAGT. .GG C A. .G G... 4230 

AAV-6 A G G..T 4238 

AAV-1 CA-AGTGAGTGTGGAAATTGAATGGGAGCTGCAGAAAGAAAACAGCAAGCGCTGGAATCC 4312 

AAV-2 . .CG..C..C. G..C..G G A 4290 

AAV-6 ..- C G... A . ... 4297 

AAV-1 CGAAGTGCAGTACACATCCAATTATGCAAAATCTGCCAA-CGTTGATTTTACTGTGGACA 4371 

AAV-2 A.T T C..CAAC..G TT..T...G..C C T. 4350 

AAV-6 T T..C - C 4356 

AAV-1 ACAATGGACTTTATACTGAGCCTCGCCCCATTGGCACCCGTTACCTTACCCGTCCCCTGT 44 31 

AAV-2 CT CG.G...T.A A. A G..T...AAT 4410 

AAV-6 • - C 4416 

VP1-3 stop Po lyA si gnal 
AAV-1 AATTACGTGTTAATCAATAAACCGGTTGATTCGTTTCAGTTGAACTTTGGTCTCCTGTCC 44 91 

AAV-2 ...G.T T..A TGCGTA 4470 

AAV-6 ....GT A G A G 4476 

AAV-1 TTCTTATCTTATC-GGTTACCATGGTTAT-AGCTTACACATTA — ACTGCTTGGTTGCGC 4547 

AAV-2 ..TC.T TA...T C. .CGTAGA. .AGT.GC.TGG.G.G. .AA.CATTA 4530 

AAV-6 ..A T...C A.CA.C-C.G ~ A 4533 

AAV-1 TTCGCGATAAAAGACTTACGTCATCGGGt tacccctagtgatggagttgcccact ccctc 4 607 

AAV-2 ACTA. A. gg. a 9 457 ? 

AAV-6 at. 4572 

AAV-1 tctgcgcgctcgctcgctcggtggggccggcagagcagagctctgccgtctgcggacctt 4667 

AAV-2 .c. ac.a gc. .c. .a. .g. .gc. . .a.gcvc.gg. . . 4630 

AAV-6 .a g 4632 

AAV-1 tggtccgcaggccccaccgagcgagcgagcgcgcagagagggagtgggcaa 4718 

AAV-2 . .ccg.gc 1. .gt c. . . 4 681 

AAV-6 ..--t 4 683 
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FIG. 5A 
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FIG. 6A 



FIG. 6B 
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SEQUENCE LISTING 



<110> Wilson, James M. 

f 

Xiao, Weidong 

The Trustees of the University of Pennsylvania 

<120> Adeno-Associated Virus Serotype I Nucleic Acid 

Sequences, Vectors and Host Cells Containing Same 



<130> GNVPN .031 PCT 



<140> 
<141> 

<150> 60/107, 114 
<151> 1998-11-05 



<160> 20 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 4718 
<212> DNA 
<213> AAV-1 



<220> 
<221> CDS 

<222> (335) . . (2206) 

<220> 
<221> CDS 

<222> (2223) . . (4430) 
<400> 1 

ttgcccactc cctctctgcg cgctcgctcg ctcggtgggg cctgcggacc aaaggtccgc 60 

agacggcaga gctctgctct gccggcccca ccgagcgagc gagcgcgcag agagggagtg 120 

ggcaactcca tcactagggg taatcgcgaa gcgcctccca cgctgccgcg tcagcgctga 180 

cgtaaattac gtcatagggg agtggtcctg tattagctgt cacgtgagtg cttttgcgac 240 

attttgcgac accacgtggc catttagggt atatatggcc gagtgagcga gcaggatctc 300 

cattttgacc gcgaaatttg aacgagcagc agcc atg ccg ggc ttc tac gag ate 355 

Met Pro Gly Phe Tyr Glu lie 
1 5 



1 
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gtg ate aag gtg ccg age gac ctg gac gag cac ctg ccg ggc att tct 403 

Val lie Lys Val Pro Ser Asp Leu Asp Glu His Leu Pro Gly lie Ser 

10 15 20 

gac teg ttt gtg age tgg gtg gee. gag aag gaa tgg gag ctg ccc ccg 451 

Asp Ser Phe Val Ser Trp Val Ala Glu Lys Glu Trp Glu Leu Pro Pro 

25 30 35 

gat tct gac atg gat ctg aat ctg att gag cag gca ccc ctg ace gtg 499 

Asp Ser Asp Met Asp Leu Asn Leu lie Glu Gin Ala Pro Leu Thr Val 

40 45 50 55 

gec gag aag ctg cag cgc gac ttc ctg gtc caa tgg cgc cgc gtg agt 547 

Ala Glu Lys Leu Gin Arg Asp Phe Leu Val Gin Trp Arg Arg Val Ser 

60 65 70 



595 



aag gec ccg gag gec etc ttc ttt gtt cag ttc gag aag ggc gag tec 
Lys Ala Pro Glu Ala Leu Phe Phe Val Gin Phe Glu Lys Gly Glu Ser 
75 80 85 

tac ttc cac etc cat att ctg gtg gag acc acg ggg gtc aaa tec atg 643 
Tyr Phe His Leu His lie Leu Val Glu Thr Thr Gly Val Lys Ser Met 
90 95 100 



gtg ctg ggc cgc 
Val Leu Gly Arg 
105 

ate tac cgc ggg 
He Tyr Arg Gly 
120 

aag acg cgt aat 
Lys Thr Arg Asn 

tac ate ccc aac 
Tyr He Pro Asn 
155 

gcg tgg act aac 
Ala Trp Thr Asn 
170 



ttc ctg agt 
Phe Leu Ser 
110 

ate gag ccg 
He Glu Pro 
125 

ggc gec gga 
Gly Ala Gly 
140 

tac etc ctg 
Tyr Leu Leu 

atg gag gag 
Met Glu Glu 



cag att agg gac 
Gin He Arg Asp 

acc ctg ccc aac 
Thr Leu Pro Asn 
130 

999 999 aac aa 9 
Gly Gly Asn Lys 
145 

ccc aag act cag 
Pro Lys Thr Gin 
160 

tat ata age gee 
Tyr He Ser Ala 
175 



aag ctg gtg 
Lys Leu Val 
115 

tgg ttc gcg 
Trp Phe Ala 



gtg gtg gac 
Val Val Asp 



ccc gag ctg 
Pro Glu Leu 
165 

tgt ttg aac 
Cys Leu Asn 
180 



cag acc 691 
Gin Thr 



gtg acc 739 
Val Thr 
135 

gag tgc 787 

Glu Cys 

150 

cag tgg 835 
Gin Trp 

ctg gee 883 
Leu Ala 



gag cgc aaa egg etc gtg gcg cag cac ctg acc cac gtc age cag acc 
Glu Arg Lys Arg Leu Val Ala Gin His Leu Thr His Val Ser Gin Thr 
185 190 195 



931 
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cag gag cag 
Gin Glu Gin 
200 

ate egg tea 
lie Arg Ser 



gtg gac egg 
Val Asp Arg 



gee teg tac 
Ala Ser Tyr 
250 

aag gee get 
Lys Ala Ala 
265 

gcg ccc gac 
Ala Pro Asp 
280 

aac cgc ate 
Asn Arg He 

gee ggc tec 
Ala Gly Ser 

aac ace ate 
Asn Thr He 
330 

gcg gaa gec 
Ala Glu Ala 
345 

acc aat gag 
Thr Asn Glu 
360 

tgg tgg gag 
Trp Trp Glu 



aac aag gag 
Asn Lys Glu 
205 

aaa acc tec 
Lys Thr Ser 
220 

ggc ate acc 
Gly He Thr 
235 

ate tec ttc 
He Ser Phe 



ctg gac aat 
Leu Asp Asn 

tac ctg gta 
Tyr Leu Val 
285 

tac cgc ate 
Tyr Arg He 
300 

gtc ttt etc 
Val Phe Leu 
315 

tgg ctg ttt 
Trp Leu Phe 



ate gee cac 
He Ala His 



aac ttt ccc 
Asn Phe Pro 
365 

gag ggc aag 
Glu Gly Lys 
380 



aat ctg aac 
Asn Leu Asn 



gcg cgc tac 
Ala Arg Tyr 



tec gag aag 
Ser Glu Lys 
240 

aac gec get 
Asn Ala Ala 
255 

gec ggc aag 
Ala Gly Lys 
270 

ggc ccc get 
Gly Pro Ala 



ctg gag ctg 
Leu Glu Leu 



ggc tgg gec 
Gly Trp Ala 
320 

ggg ccg gee 
Gly Pro Ala 
335 

gee gtg ccc 
Ala Val Pro 
350 

ttc aat gat 
Phe Asn Asp 



atg acg gec 
Met Thr Ala 



ccc aat tct 
Pro Asn Ser 
210 

atg gag ctg 
Met Glu Leu 

225 

cag tgg ate 
Gin Trp He 

tec aac teg 
Ser Asn Ser 



ate atg gcg 
He Met Ala 
275 

ccg ccc gcg 
Pro Pro Ala 
290 

aac ggc tac 
Asn Gly Tyr 
305 

cag aaa agg 
Gin Lys Arg 



acc acg ggc 
Thr Thr Gly 



ttc tac ggc 
Phe Tyr Gly 
355 

tgc gtc gac 
Cys Val Asp 
370 

aag gtc gtg 
Lys Val Val 
385 



gac gcg cct 
Asp Ala Pro 

gtc ggg tgg 
Val Gly Trp 
230 

cag gag gac 
Gin Glu Asp 
245 

egg tec cag 
Arg Ser Gin 
260 

ctg acc aaa 
Leu Thr Lys 

gac att aaa 
Asp lie Lys 

gaa cct gee 
Glu Pro Ala 
310 

ttc ggg aag 
Phe Gly Lys 
325 

aag ace aac 
Lys Thr Asn 
340 

tgc gtc aac 
Cys Val Asn 



aag atg gtg 
Lys Met Val 



gag tec gec 
Glu Ser Ala 
390 



gtc 979 

Val 

215 

ctg 1027 
Leu 



cag 1075 
Gin 



ate 1123 
He 



tec 1171 
Ser 



acc 1219 

Thr 

295 

tac 1267 
Tyr 

cgc 1315 
Arg 



ate 1363 
He 



tgg 1411 
Trp 



ate 1459 

He 

375 

aag 1507 
Lys 



3 
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gcc att etc 
Ala lie Leu 



tec gcc cag 
Ser Ala Gin 
410 

atg tgc gcc 
Met Cys Ala 
425 

ccg ttg cag 
Pro Leu Gin 
440 

cat gac ttt 
His Asp Phe 



tgg gcg cag 
Trp Ala Gin 



aag ggt gga 
Lys Gly Gly 
490 

gag ccc aag 
Glu Pro Lys 
505 

gcg gaa gga 
Ala Glu Gly 
520 

tct cgt cac 
Ser Arg His 



gag aga atg 
Glu Arg Met 



gac tgt tea 
Asp Cys Ser 
570 



ggc ggc age 
Gly Gly Ser 
395 

ate gac ccc 
lie Asp Pro 



gtg att gac 
Val lie Asp 



gac egg atg 
Asp Arg Met 
445 

ggc aag gtg 
Gly Lys Val 
4 60 

gat cac gtg 
Asp His Val 
475 

gcc aac aaa 
Ala Asn Lys 



egg gcc tgc 
Arg Ala Cys 



get ccg gtg 
Ala Pro Val 
525 

gcg ggc atg 
Ala Gly Met 
540 

aat cag aat 
Asn Gin Asn 
555 

gag tgc ttc 
Glu Cys Phe 



aag gtg cgc 
Lys Val Arg 
400 

ace ccc gtg 
Thr Pro Val 
415 

ggg aac age 
Gly Asn Ser 
430 

ttc aaa ttt 
Phe Lys Phe 



aca aag cag 
Thr Lys Gin 



acc/ gag gtg 
Thr Glu Val 
480 

aga ccc gcc 
Arg Pro Ala 
495 

ccc tea gtc 
Pro Ser Val 
510 

gac ttt gcc 
Asp Phe Ala 



ctt cag atg 
Leu Gin Met 



ttc aac att 
Phe Asn lie 
560 

ccc ggc gtg 
Pro Gly Val 
575 



gtg gac caa 
Val Asp Gin 



ate gtc acc 
He Val Thr 



acc acc ttc 
Thr Thr Phe 
435 

gaa etc acc 
Glu Leu Thr 
450 

gaa gtc aaa 
Glu Val Lys 
465 

gcg cat gag 
Ala His Glu 



ccc gat gac 
Pro Asp Asp 



gcg gat cca 
Ala Asp Pro 
515 

gac agg tac 
Asp Arg Tyr 
530 

ctg ttt ccc 
Leu Phe Pro 
545 

tgc ttc acg 
Cys Phe Thr 



tea gaa tct 
Ser Glu Ser 



aag tgc aag 
Lys Cys Lys 
405 

tec aac acc 
Ser Asn Thr 
420 

gag cac cag 
Glu His Gin 



cgc cgt ctg 
Arg Arg Leu 



gag ttc ttc 
Glu Phe Phe 
470 

ttc tac gtc 
Phe Tyr Val 
485 

gcg gat aaa 
Ala Asp Lys 
500 

teg acg tea 
Ser Thr Ser 



caa aac aaa 
Gin Asn Lys 

tgc aag aca 
Cys Lys Thr 
550 

cac ggg acg 
His Gly Thr 
565 

caa ccg gtc 
Gin Pro Val 
580 



teg 1555 
Ser 



aac -1603. 
Asn 



cag 1651 
Gin 



gag 1699 

Glu 

455 

cgc 1747 
Arg 



aga 1795 
Arg 



age 1843 
Ser 



gac 1891 
Asp 



tgt 1939 

Cys 

535 

tgc 1987 
Cys 

aga 2035 
Arg 



gtc 2083 
Val 
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aga aag agg acg tat egg aaa etc tgt gee att cat cat ctg ctg ggg 2131 

Arg Lys Arg Thr Tyr Arg Lys Leu Cys Ala lie His His Leu Leu 4 Gly 
585 590 595 

egg get ccc gag att get tgc teg gec tgc gat ctg gtc aac gtg gac 2179 

Arg Ala Pro Glu lie Ala Cys Ser Ala Cys Asp Leu Val Asn Val Asp 
600 605 610 615 



ctg gat gac tgt gtt tct gag caa taa atgacttaaa ccaggt atg get gee 2231 
Leu Asp Asp Cys Val Ser Glu Gin Met Ala Ala 

620 625 



gat ggt tat ctt cca gat tgg etc gag gac aac etc tct gag ggc att 2279 
Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser Glu Gly lie 
630 635 640 



cgc gag tgg tgg gac ttg aaa cct gga gee ccg aag ccc aaa gec aac 2327 

Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro Lys Ala Asn 

645 650 655 

cag caa aag cag gac gac ggc egg ggt ctg gtg ctt cct ggc tac aag 2375 

Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro Gly Tyr Lys 

660 665 670 675 

tac etc gga ccc ttc aac gga etc gac aag ggg gag ccc gtc aac gcg 2423 

Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro Val Asn Ala 

680 685 690 



gcg gac gca gcg gee etc gag cac gac aag gec tac gac cag cag etc 2471 
Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp Gin Gin Leu 
695 700 705 



aaa gcg ggt gac aat ccg tac ctg egg tat aac cac gec gac gee gag 2519 

Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala Asp Ala Glu 

710 715 720 

ttt cag gag cgt ctg caa gaa gat acg tct ttt ggg ggc aac etc ggg 2567 

Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly Asn Leu Gly 
725 730 735 

cga gca gtc ttc cag gee aag aag egg gtt etc gaa cct etc ggt ctg 2615 

Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro Leu Gly Leu 
740 745 750 755 

gtt gag gaa ggc get aag acg get cct gga aag aaa cgt ccg gta gag 2663 

Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg Pro Val Glu 

760 765 770 
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cag teg cca caa gag cca gac tec tec teg ggc ate ggc aag aca ggc 2711 

Gin Ser Pro Gin Glu Pro Asp Ser Ser Ser Gly lie Gly Lys Thr Gly 

775 780 785 

cag cag ccc get aaa aag aga etc aat ttt ggt cag act ggc gac tea 2759 

Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin Thr Gly Asp Ser 

790 795 800 

gag tea gtc ccc gat cca caa cct etc gga gaa cct cca gca acc ccc 2807 

Glu Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro Pro Ala Thr Pro 

805 810 815 

get get gtg gga cct act aca atg get tea ggc ggt ggc gca cca atg 2855 

Ala Ala Val Gly Pro Thr Thr Met Ala Ser Gly Gly Gly Ala Pro Met 

820 825 830 835 

gca gac aat aac gaa ggc gee gac gga gtg ggt aat gee tea gga aat 2903 

Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala Ser Gly Asn 
840 845 850 

tgg cat tgc gat tec aca tgg ctg ggc gac aga gtc ate acc acc age 2951 

Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val lie Thr Thr Ser 

855 860 865 

acc cgc acc tgg gec ttg ccc acc tac aat aac cac etc tac aag caa 2999 

Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Gin 

870 875 880 

ate tec agt get tea acg ggg gee age aac gac aac cac tac ttc ggc 3047 

lie Ser Ser Ala Ser Thr Gly Ala Ser Asn Asp Asn His Tyr Phe Gly 

885 890 895 

tac age acc ccc tgg ggg tat ttt gat ttc aac aga ttc cac tgc cac 3095 

Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His Cys His 

900 905 910 915 

ttt tea cca cgt gac tgg cag cga etc ate aac aac aat tgg gga ttc 3143 

Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn Asn Asn Trp Gly Phe 
920 925 930 

egg ccc aag aga etc aac ttc aaa etc ttc aac ate caa gtc aag gag 3191 

Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn lie Gin Val Lys Glu 

935 940 945 

gtc acg acg aat gat ggc gtc aca acc ate get aat aac ctt acc age 3239 

Val Thr Thr Asn Asp Gly Val Thr Thr lie Ala Asn Asn Leu Thr Ser 

950 955 960 
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acg gtt caa gtc ttc teg gac teg gag tac cag ctt ccg tac gtc etc 3287 
Thr Val Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu Pro Tyr Val Leu 
965 970 975 

ggc tct gcg cac cag ggc tgc etc cct ccg ttc ccg gcg gac gtg ttc 3335 
Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala Asp Val Phe 
S80 985 990 995 

atg att ccg caa tac. ggc tac ctg acg etc aac aat ggc age caa gee 3383 
Met He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser Gin Ala 
1000 1005 1010 

gtg gga cgt tea tec ttt tac tgc ctg gaa tat ttc cct tct cag atg 3431 
Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gin Met 
1015 1020 1025 

ctg aga acg ggc aac aac ttt ace ttc age tac ace ttt gag gaa gtg 3479 
Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe Glu Glu Val 
1030 1035 1040 

cct ttc cac age age tac gcg cac age cag age ctg gac egg ctg atg 3527 
Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp Arg Leu Met 
1045 1050 1055 

aat cct etc ate gac caa tac ctg tat tac ctg aac aga act caa aat 3575 
Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu Asn Arg Thr Gin Asn 
1060 1065 1070 1075 

cag tec gga agt gec caa aac aag gac ttg ctg ttt age cgt ggg tct 3623 
Gin Ser Gly Ser Ala Gin Asn Lys Asp Leu Leu Phe Ser Arg Gly Ser 
1080 1085 1090 

cca get ggc atg tct gtt cag ccc aaa aac tgg eta cct gga ccc tgt 3671 
Pro Ala Gly Met Ser Val Gin Pro Lys Asn Trp Leu Pro Gly Pro Cys 
1095 1100 1105 

tat egg cag cag cgc gtt tct aaa aca aaa aca gac aac aac aac age 3719 
Tyr Arg Gin Gin Arg Val Ser Lys Thr Lys Thr Asp Asn Asn Asn Ser 
1110 1115 1120 

aat ttt acc tgg act ggt get tea aaa tat aac etc aat ggg cgt gaa 3767 
Asn Phe Thr Trp Thr Gly Ala Ser Lys Tyr Asn Leu Asn Gly Arg Glu 
1125 1130 1135 

tec ate ate aac cct ggc act get atg gee tea cac aaa gac gac gaa 3815 
Ser He He Asn Pro Gly Thr Ala Met Ala Ser His Lys Asp Asp Glu 
1140 1145 1150 1155 
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gac aag ttc ttt ccc atg age ggt gtc atg att ttt gga aaa gag age 3863 
Asp Lys Phe Phe Pro Met Ser Gly Val Met lie Phe Gly Lys Glu Ser 
1160 1165 1170 

gec gga get tea aac act gca ttg gac aat gtc atg att aca gac gaa 3911 
Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn Val Met lie Thr Asp Glu 
1175 1180 1185 

gag gaa att aaa gec act aac cct gtg gec acc gaa aga ttt ggg acc 3959 
Glu Glu He Lys Ala Thr Asn Pro Val Ala Thr Glu Arg Phe Gly Thr 
1190 1195 1200 

gtg gca gtc aat ttc cag age age age aca gac cct gcg acc gga gat 4007 
Val Ala Val Asn Phe Gin Ser Ser Ser Thr Asp Pro Ala Thr Gly Asp 
1205 1210 1215 

gtg cat get atg gga gca tta cct ggc atg gtg tgg caa gat aga gac 4055 
Val His Ala Met Gly Ala Leu Pro Gly Met Val Trp Gin Asp Arg Asp 
1220 1225 1230 1235 

gtg tac ctg cag ggt ccc att tgg gee aaa att cct cac aca gat gga 4103 
Val Tyr Leu Gin Gly Pro He Trp Ala Lys He Pro His Thr Asp Gly 
1240 1245 1250 

cac ttt cac ccg tct cct ctt atg ggc ggc ttt gga etc aag aac ccg 4151 
His Phe His Pre Ser Pro Leu Met Gly Gly Phe Gly Leu Lys Asn Pro 
1255 1260 1265 

cct cct cag ate etc ate aaa aac acg cct gtt cct gcg aat cct ccg 4199 
Pro Pro Gin He Leu He Lys Asn Thr Pro Val Pro Ala Asn Pro Pro 
1270 1275 1280 

gcg gag ttt tea get aca aag ttt get tea ttc ate acc caa tac tec 4247 
Ala Glu Phe Ser Ala Thr Lys Phe Ala Ser Phe lie Thr Gin Tyr Ser 
1285 1290 1295 

aca gga caa gtg agt gtg gaa att gaa tgg gag ctg cag aaa gaa aac 4295 
Thr Gly Gin Val Ser Val Glu He Glu Trp Glu Leu Gin Lys Glu Asn 
1300 1305 1310 1315 

age aag cgc tgg aat ccc .gaa gtg cag tac aca tec aat tat gca aaa 4343 
Ser Lys Arg Trp Asn Pro Glu Val Gin Tyr Thr Ser Asn Tyr Ala Lys 
1320 1325 1330 

tct gee aac gtt gat ttt act gtg gac aac aat gga ctt tat act gag 4391 
Ser Ala Asn Val Asp Phe Thr Val Asp Asn Asn Gly Leu Tyr Thr Glu 
1335 1340 1345 
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cct cgc ccc att ggc acc cgt tac ctt acc cgt ccc ctg taattacgtg 4440 

Pro Arg Pro lie Gly Thr Arg Tyr Leu Thr Arg Pro Leu 
1350 1355 1360 

ttaatcaata aaccggttga ttcgtttcag ttgaactttg gtctcctgtc cttcttatct 4500 

tatcggttac catggttata gcttacacat taactgcttg gttgcgcttc gcgataaaag 4560 

acttacgtca tcgggttacc cctagtgatg gagttgccca ctccctctct gcgcgctcgc 4620 

tcgctcggtg gggcctgcgg accaaaggtc cgcagacggc agagctctgc tctgccggcc 4680 

ccaccgagcg agcgagcgcg cagagaggga gtgggcaa 4718 



<210> 2 
<211> 623 
<212> PRT 
<213> AAV-1 

<400> 2 

Met Pro Gly Phe Tyr Glu lie Val lie Lys Val Pro Ser Asp Leu Asp 
1 5 10 15 

Glu His Leu Pro Gly lie Ser Asp Ser Phe Val Ser Trp Val Ala Glu 
20 25 30 

Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu lie 
35 40 45 

Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 
50 55 60 

Val Gin Trp Arg Arg Val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 
65 70 75 80 

Gin Phe Glu Lys Gly Glu Ser Tyr Phe His Leu His lie Leu Val Glu 
85 90 95 

Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gin lie 
100 105 110 

Arg Asp Lys Leu Val Gin Thr lie Tyr Arg Gly lie Glu Pro Thr Leu 
115 120 125 

Pro Asn Trp Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 
130 135 140 
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Asn Lys Val Val Asp Glu Cys Tyr He Pro Asn Tyr Leu Leu Pro Lys' 
145 150 155 . 160 

Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Glu Tyr He 
165 170 175 

Ser Ala Cys Leu Asn Leu Ala Glu Arg Lys Arg Leu Val Ala Gin His 
180 185 190 

Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Leu Asn 
195 200 205 

Pro Asn Ser Asp Ala Pro Val He Arg Ser Lys Thr Ser Ala Arg Tyr 
210 215 220 

Met Glu Leu Val Gly Trp Leu Val Asp Arg Gly He Thr Ser Glu Lys 
225 230 235 240 

Gin Trp He Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 
245. 250 255 

Ser Asn Ser Arg Ser Gin lie Lys Ala Ala Leu Asp Asn Ala Gly Lys 
260 265 270 

■lie Met Ala Leu Thr Lys Ser Ala Pro Asp Tyr Leu Val Gly Pro Ala 
275 280 285 

Pro Pro Ala Asp He Lys Thr Asn Arg He Tyr Arg He Leu Glu Leu 
290 295 300 



Asn Gly Tyr Glu Pro 
305 

Gin Lys Arg Phe Gly 
325 

Thr Thr Gly Lys Thr 
340 

Phe Tyr Gly Cys Val 
355 

Cys Val Asp Lys Met 
370 

Lys Val Val Glu Ser 
385 



Ala Tyr Ala Gly Ser Val 
310 315 

Lys Arg Asn Thr He Trp 
330 

Asn He Ala Glu Ala He 
345 

Asn Trp Thr Asn Glu Asn 
360 

Val He Trp Trp Glu Glu 
375 

Ala Lys Ala He Leu Gly 
3S0 395 



Phe Leu Gly Trp Ala 
320 

Leu Phe Gly Pro Ala 
335 

Ala His Ala Val Pro 
350 

Phe Pro Phe Asn Asp 
365 

Gly Lys Met Thr Ala 
380 

Gly Ser Lys Val Arg 
400 
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Val Asp Gin Lys Cys 
405 

He Val Thr Ser Asn 
420 

Thr Thr Phe Glu His 
435 

Glu Leu Thr Arg Arg 
450 

Glu Val Lys Glu Phe 
465 

Ala His Glu Phe Tyr 
485 

Pro Asp Asp Ala Asp 
500 

Ala Asp Pro Ser Thr 
515 

Asp Arg Tyr Gin Asn 
530 

Leu Phe Pro Cys Lys 
545 

Cys Phe Thr His Gly 
565 

Ser Glu Ser Gin Pro 
580 

Ala He His His Leu 
595 

Cys Asp Leu Val Asn 
610 



Lys Ser Ser Ala Gin He 
410 

Thr Asn Met Cys Ala Val 
425 

Gin Gin Pro Leu Gin Asp 
440 

Leu Glu His Asp Phe Gly 
455 

Phe Arg Trp Ala Gin Asp 
470 475 

Val Arg Lys Gly Gly Ala 
490 

Lys Ser Glu Pro Lys Arg 
505 

Ser Asp Ala Glu Gly Ala 
520 

Lys Cys Ser Arg His Ala 
535 

Thr Cys Glu Arg Met Asn 
550 555 

Thr Arg Asp Cys Ser Glu 
570 

Val Val Arg Lys Arg Thr 
585 

Leu Gly Arg Ala Pro Glu 
600 

Val Asp Leu Asp Asp Cys 
615 



Asp Pro Thr Pro Val 
415 

He Asp Gly Asn Ser 
430 

Arg Met Phe Lys Phe 
445 

Lys Val Thr Lys Gin 
460 

His Val Thr Glu Val 
480 

Asn Lys Arg Pro Ala 
495 

Ala Cys Pro Ser Val 
510 

Pro Val Asp Phe Ala 
525 

Gly Met Leu Gin Met 
540 

Gin Asn Phe Asn He 
560 

Cys Phe Pro Gly Val 
575 

Tyr Arg Lys Leu Cys 
590 

He Ala Cys Ser Ala 
605 

Val Ser Glu Gin 
620 



<210> 3 
<211> 736 
<212> PRT 
<213> AAV-1 
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<400> 3 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 

Glu Gly lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 

Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 

Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 

Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 

Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 

Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 

Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 

Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 

Pro Val Glu Gin Ser Pro Gin Glu Pro Asp Ser Ser Ser Gly He Gly 
145 150 155 160 

Lys Thr Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin Thr 
165 170 175 

Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro Pro 
180 185 190 

Ala Thr Pro Ala Ala Val Gly Pro Thr Thr Met Ala Ser Gly Gly Gly 
195 200 205 

Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala 
210 215 220 

Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val He 
225 230 235 240 

Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
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245 250 255 



Tyr Lys Gin lie Ser Ser Ala Ser Thr Gly Ala Ser Asn Asp Asn His 
260 265 270 

Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe 

275 280 285 



His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn Asn Asn 
290 295 300 

Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn lie Gin 

305 310 315 320 

Val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr lie Ala Asn Asn 

325 330 335 



Leu Thr Ser Thr Val Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu Pro 
340 345 350 

Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala 
355 360 365 

Asp Val Phe Met lie Pre Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly 
370 375 380 

Ser Gin Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro 
385 390 395 400 

Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe 
405 410 415 



Glu Glu Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp 
420 425 430 

Arg Leu Met Asn Pro Leu lie Asp Gin Tyr Leu Tyr Tyr Leu Asn Arg 
435 440 445 

Thr Gin Asn Gin Ser Gly Ser Ala Gin Asn Lys Asp Leu Leu Phe Ser 
450 455 460 

Arg Gly Ser Pro Ala Gly Met Ser Val Gin Pro Lys Asn Trp Leu Pro 
465 470 475 480 

Gly Pro Cys Tyr Arg Gin Gin Arg Val Ser Lys Thr Lys Thr Asp Asn 
485 490 495 

Asn Asn Ser Asn Phe Thr Trp Thr Gly Ala Ser Lys Tyr Asn Leu Asn 
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510 



Gly Arg Glu Ser lie 
515 

Asp Asp Glu Asp Lys 
530 

Lys Glu Ser Ala Gly 
545 

Thr Asp Glu Glu Glu 
565 

Phe Gly Thr Val Ala 
580 

Thr Gly Asp Val His 
595 

Asp Arg Asp Val Tyr 
610 

Thr Asp Gly His Phe 
625 

Lys Asn Pro Pro Pro 
645 

Asn Pro Pro Ala Glu 
660 

Gin Tyr Ser Thr Gly 
675 

Lys Glu Asn Ser Lys 
690 

Tyr Ala Lys Ser Ala 
705 

Tyr Thr Glu Pro Arg 
725 



lie Asn Pro Gly Thr 
520 

Phe Phe Pro Met Ser 
535 

Ala Ser Asn Thr Ala 
550 

lie Lys Ala Thr Asn 
570 

Val Asn Phe Gin Ser 
585 

Ala Met Gly Ala Leu 
600 

Leu Gin Gly Pro lie 
615 

His Pro Ser Pro Leu 
630 

Gin lie Leu lie Lys 
650 

Phe Ser Ala Thr Lys 
665 

Gin Val Ser Val Glu 
680 

Arg Trp Asn Pro Glu 
695 

Asn Val Asp Phe Thr 
710 

Pro lie Gly Thr Arg 
730 



Ala Met Ala Ser His Lys 
525 

Gly Val Met He Phe Gly 
540 

Leu Asp Asn Val Met He 
555 560 

Pro Val Ala Thr Glu Arg 
575 

Ser Ser Thr Asp Pro Ala 
590 

Pro Gly Met Val Trp Gin 
605 

Trp Ala Lys He Pro His 
620 

Met Gly Gly Phe Gly Leu 
635 640 

Asn Thr Pro Val Pro Ala 
655 

Phe Ala Ser Phe He Thr 
670 

He Glu Trp Glu Leu Gin 
685 

Val Gin Tyr Thr Ser Asn 
700 

Val Asp Asn Asn Gly Leu 
715 720 

Tyr Leu Thr Arg Pro Leu 
735 



<210> 4 
<21i> 1872 
<212> DNA 
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<2-13> AAV-1 

<220> 

<221> CDS 

<222> (1) . . (1869) 

<400> 4 

atg ccg ggc ttc tac gag ate gtg ate aag gtg ccg age gac ctg gac 48 

Met Pro Gly Phe Tyr Glu He Val He Lys Val Pro Ser Asp Leu Asp 
15 10 15 

gag cac ctg ccg ggc att tct gac teg ttt gtg age tgg gtg gec gag 96 
Glu His Leu Pro Gly He Ser Asp Ser Phe Val Ser Trp Val Ala Glu 
20 25 30 

aag gaa tgg gag ctg ccc ccg gat tct gac atg gat ctg aat ctg att 144 
Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu He 
35 40 45 

gag cag gca ccc ctg acc gtg gec gag aag ctg cag cgc gac ttc ctg 192 
Glu Gin Ala Pro Leu.. 'Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 
50 55 60 

gtc caa tgg cgc cgc gtg agt aag gee ccg gag gec etc ttc ttt gtt 240, 
Val Gin Trp Arg Arg Val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 
65 70 75 80 

cag ttc gag aag ggc gag tec tac ttc cac etc cat att ctg gtg gag 288 
Gin Phe Glu Lys Gly Glu Ser Tyr Phe His Leu His He Leu Val Glu 
85 90 95 

acc acg ggg gtc aaa tec atg gtg ctg ggc cgc ttc ctg agt cag att 336 
Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gin He 
100 105 110 

agg gac aag ctg gtg cag acc ate tac cgc ggg ate gag ccg acc ctg 384 
Arg Asp Lys Leu Val Gin Thr He Tyr Arg Gly He Glu Pro Thr Leu 
115 120 125 

ccc aac tgg ttc gcg gtg acc aag acg cgt aat ggc gec gga ggg ggg 432 
Pro Asn Trp Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 
130 135 140 

aac aag gtg gtg gac gag tgc tac ate ccc aac tac etc ctg ccc aag 480 
Asn Lys Val Val Asp Glu Cys Tyr He Pro Asn Tyr Leu Leu Pro Lys 
145 150 155 160 

act cag ccc gag ctg cag tgg gcg tgg act aac atg gag gag tat ata 528 



15 



WO 00/28061 



PCIYUS99/25694 



Thx Gin Pro 



age gec tgt 
Ser Ala Cys 

ctg acc cac 
Leu Thr His 
195 

ccc aat tct 
Pro Asn Ser 
210 

atg gag ctg 
Met Glu Leu 
225 

cag tgg ate 
Gin Trp lie 



tec aac teg 
Ser Asn Ser 



ate atg gcg 
lie Met Ala 
275 

ccg ccc gcg 
Pro Pro Ala 
290 

aac ggc tac 
Asn Gly Tyr 
305 

cag aaa agg 
Gin Lys Arg 



acc acg ggc 
Thr Thr Gly 



ttc tac ggc 



Glu Leu Gin 
165 

ttg aac ctg 
Leu Asn Leu 
180 

gtc age cag 
Val Ser Gin 



gac gcg cct 
Asp Ala Pro 



gtc ggg tgg 
Val Gly Trp 
230 

cag gag gac 
Gin Glu Asp 
245 

egg tec cag 
Arg Ser Gin 
260 

ctg acc aaa 
Leu Thr Lys 

gac att aaa 
Asp lie Lys 

gaa cct gec 
Glu Pro Ala 
310 

ttc ggg aag 
Phe Gly Lys 
325 

aag acc aac 
Lys Thr Asn 
340 

tgc gtc aac 



Trp Ala Trp 



gee gag cgc 
Ala Glu Arg 
185 

acc caq gag 
Thr Gin Glu 
200 

gtc ate egg 
Val lie Arg 
215 

ctg gtg gac 
Leu Val Asp 



cag gee teg 
Gin Ala Ser 



ate aag gee 
lie Lys Ala 
265 

tec gcg ccc 
Ser Ala Pro 
280 

acc aac cgc 
Thr Asn Arg 
295 

tac gee ggc 
Tyr Ala Gly 



cgc aac acc 
Arg Asn Thr 



ate gcg gaa 
He Ala Glu 
345 

tgg acc aat 



Thr Asn Met 
170 

aaa egg etc 

Lys Arg Leu 

cag aac aag 
Gin Asn Lys 

tea aaa acc 
Ser Lys Thr 
220 

egg ggc ate 
Arg Gly He 
235 

tac ate tec 

Tyr He Ser 
250 

get ctg gac 

Ala Leu Asp 

gac tac ctg 
Asp Tyr Leu 

ate tac cgc 
He Tyr Arg 
300 

tec gtc ttt 
Ser Val Phe 
315 

ate tgg ctg 
He Trp Leu 
330 

gee ate gee 
Ala He Ala 



gag aac ttt 



Glu Glu Tyr 
175 

gtg gcg cag 
Val Ala Gin 
190 

gag aat ctg 
Glu Asn Leu 
205 

tec gcg cgc 
Ser Ala Arg 



acc tec gag 
Thr Ser Glu 



ttc aac gee 
Phe Asn Ala 
255 

aat gec ggc 
Asn Ala Gly 
270 

gta ggc ccc 
Val Gly Pro 
285 

ate ctg gag 
He Leu Glu 



etc ggc tgg 
Leu Gly Trp 



ttt ggg ccg 
Phe Gly Pro 
335 

cac gee gtg 
His Ala Val 
350 

ccc ttc aat 



He 



cac 576 
His 



aac 624 
Asn 



tac 672 
Tyr 



aag 720 

Lys 

240 

get 768 
Ala 



aag 816 
Lys 



get 864 
Ala 



ctg 912 
Leu 



gec 960 

Ala 

320 

gec 1008 
Ala 



ccc 1056 
Pro 



gat 1104 
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Pbe Tyr Gly 
355 

tgc gtc gac 
Cys Val Asp 
370 

aag gtc gtg 
Lys Val Val 
385 

gtg gac caa 
Val Asp Gin 



ate gtc acc 
lie Val Thr 



acc acc ttc 
Thr Thr Phe 
435 

gaa etc acc 
Glu Leu Thr 
450 

gaa gtc aaa 
Glu Val Lys 
465 

gcg cat gag 
Ala His Glu 



ccc gat gac 
Pro Asp Asp 



gcg gat cca 
Ala Asp Pro 
515 

gac agg tac 
Asp Arg Tyr 
530 

ctg ttt ccc 



Cys Val Asn 



aag atg gtg 
Lys Met Val 



gag tec gee 
Glu Ser Ala 
390 

aag tgc aag 
Lys Cys Lys 
405 

tec aac acc 

Ser Asn Thr 
420 

gag cac cag 

Glu His Gin 



cgc cgt ctg 
Arg Arg Leu 



gag ttc ttc 
Glu Phe Phe 
470 

ttc tac gtc 
Phe Tyr Val 
485 

gcg gat aaa 
Ala Asp Lys 
500 

teg acg tea 
Ser Thr Ser 



caa aac aaa 
Gin Asn Lys 

tgc aag aca 



Trp Thr Asn 
360 

ate tgg tgg 
lie Trp Trp 
375 

aag gec att 
Lys Ala lie 

teg tec gec 
Ser Ser Ala 



aac atg tgc 
Asn Met Cys 
425 

cag ccg ttg 
Gin Pro Leu 
440 

gag cat gac 
Glu His Asp 
455 

cgc tgg gcg 
Arg Trp Ala 



aga aag ggt 
Arg Lys Gly 



age gag ccc 
Ser Glu Pro 
505 

gac gcg gaa 
Asp Ala Glu 
520 

tgt tct cgt 
Cys Ser Arg 
535 

tgc gag aga 



Glu Asn Phe 



gag gag ggc 
Glu Glu Gly 
380 

etc ggc ggc 
Leu Gly Gly 
3S5 

cag ate gac 
Gin lie Asp 
410 

gee gtg att 
Ala Val He 



cag gac egg 
Gin Asp Arg 



ttt ggc aag 
Phe Gly Lys 
460 

cag gat cac 
Gin Asp His 
475 

gga gee aac 
Gly Ala Asn 
490 

aag egg gec 
Lys Arg Ala 



gga get ccg 
Gly Ala Pro 



cac gcg ggc 
His Ala Gly 
540 

atg aat cag 



Pro Phe Asn 
365 

aag atg acg 
Lys Met Thr 



age aag gtg 
Ser Lys Val 



ccc acc ccc 
Pro Thr Pro 
415 

gac ggg aac 
Asp Gly Asn 
430 

atg ttc aaa 
Met Phe Lys 
445 

gtg aca aag 
Val Thr Lys 



gtg acc gag 
Val Thr Glu 



aaa aga ccc 
Lys Arg Pro 
495 

tgc ccc tea 
Cys Pro Ser 
510 

gtg gac ttt 
Val Asp Phe 
525 

atg ctt cag 
Met Leu Gin 



aat* ttc aac 



Asp 



gec 1152 
Ala 

cgc 1200 

Arg 

400 

gtg 1248 
Val 



age 1296 
Ser 



ttt 1344 
Phe 



cag 1392 
Gin 



gtg 1440 

Val 

480 

gec 1488 
Ala 



gtc 1536 
Val 



gec 1584 
Ala 



atg 1632 
Met 



att 1680 
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Leu Phe Pro Cys Lys Thr Cys Glu Arg Met Asn Gin Asn Phe Asn lie 
545 550 555 560 

tgc ttc acg cac ggg acg aga gac tgt tea gag tgc ttc ccc ggo gtg 1728 
Cys Phe Thr His Gly Thr Arg Asp Cys Ser Glu Cys Phe Pro Gly Val 
565 570 575 

tea gaa tct caa ccg gtc gtc aga aag agg acg tat egg aaa etc tgt 1776 
Ser Glu Ser Gin Pro Val Val Arg Lys Arg Thr Tyr Arg Lys Leu Cys 
580 585 590 

gee att cat cat ctg ctg ggg egg get ccc gag att get tgc teg gee 1824 
Ala lie His His Leu Leu Gly Arg Ala Pro Glu lie Ala Cys Ser Ala 
595 600 605 

tgc gat ctg gtc aac gtg gac ctg gat gac tgt gtt tct gag caa taa 1872 
Cys Asp Leu Val Asn Val Asp Leu Asp Asp Cys Val Ser Glu Gin 
610 615 620 



<210> 5 
<211> 623 
<212> PRT 
<213> AAV-1 

<400> 5 

Met Pro Gly Phe 
1 

Glu His Leu Pro 
20 

Lys Glu Trp Glu 
35 

Glu Gin Ala Pro 
50 

Val Gin Trp Arg 
65 

Gin Phe Glu Lys 



Thr Thr Gly Val 
100 

Arg Asp Lys Leu 



Tyr Glu He Val 
5 

Gly He Ser Asp 



Leu Pro Pro Asp 
40 

Leu Thr Val Ala 
55 

Arg Val Ser Lys 
70 

Gly Glu Ser Tyr 
85 

Lys Ser Met Val 



Val Gin Thr He 



He Lys Val Pro 
10 

Ser Phe Val Ser 
25 

Ser Asp Met Asp 



Glu Lys Leu Gin 
60 

Ala Pro Glu Ala 
75 

Phe His Leu His 
90 

Leu Gly Arg Phe 
105 

Tyr Arg Gly He 



Ser Asp Leu Asp 
15 

Trp Val Ala Glu 
30 

Leu Asn Leu He 
45 

Arg Asp Phe Leu 



Leu Phe Phe Val 
80 

He Leu Val Glu 
95 

Leu Ser Gin He 
110 

Glu Pro Thr Leu 
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115 120 125 

Pro Asn Trp Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 
130 135 140 

Asn Lys Val Val Asp Glu Cys Tyr lie Pro Asn Tyr Leu Leu Pro Lys 
145 150 155 160 

Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Glu Tyr lie 
165 170 175 

Ser Ala Cys Leu Asn Leu Ala Glu Arg Lys Arg Leu Val Ala Gin His 
180 185 190 

Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Leu Asn 
195 200 205 

Pro Asn Ser Asp Ala Pro Val lie Arg Ser Lys Thr Ser Ala Arg Tyr 
210 215 220 

Met Glu Leu Val Gly Trp Leu Val Asp Arg Gly He Thr Ser Glu Lys 
225 230 235 240 

Gin Trp He Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 
245 250 255 

Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 
260 265 270 

He Met Ala Leu Thr Lys Ser Ala Pro Asp Tyr Leu Val Gly Pro Ala 
275 280 285 

Pro Pro Ala Asp He Lys Thr Asn Arg He Tyr Arg He Leu Glu Leu 
290 295 300 

Asn Gly Tyr Glu Pro Ala Tyr Ala Gly Ser Val Phe Leu Gly Trp Ala 
305 310 315. 320 

Gin Lys Arg Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 
325 330 335 

Thr Thr Gly Lys Thr Asn lie Ala Glu Ala He Ala His Ala Val Pro 
340 345 350 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe ASn Asp 
355 360 365 

Cys Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 

1 c. 
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Lys Val Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys, Val Arg 
385 390 395 400 

Val Asp Gin Lys Cys Lys Ser Ser, Ala Gin He Asp Pro Thr Pro Val 
405 410 415 

He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 
420 425 430 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 
435 440 445 

Glu Leu Thr Arg Arg Leu Glu His Asp Phe Gly Lys Val Thr Lys Gin 
450 455 460 

Glu Val Lys Glu Phe Phe Arg Trp Ala Gin Asp His Val Thr Glu Val 
465 470 475 480 

Ala His Glu Phe Tyr Val Arg Lys Gly Gly Ala Asn Lys Arg Pro Ala 
485 " 490 495 

Pro Asp Asp Ala Asp Lys Ser Glu Pro Lys Arg Ala Cys Pro Ser Val 
500 505 510 

Ala Asp Pro Ser Thr Ser Asp Ala Glu Gly Ala Pro Val Asp Phe Ala 
515 520 525 

Asp Arg Tyr Gin Asn Lys Cys Ser Arg His Ala Gly Met Leu Gin Met 
530 535 540 

Leu Phe Pro Cys Lys Thr Cys Glu Arg Met Asn Gin Asn Phe Asn He 
545 550 555 560 

Cys Phe Thr His Gly Thr Arg Asp Cys Ser Glu Cys Phe Pro Gly Val 
.565 570 575 

Ser Glu Ser Gin Pro Val Val Arg Lys Arg Thr Tyr Arg Lys Leu Cys 
580 585 590 

Ala He His His Leu Leu Gly Arg Ala Pro Glu He Ala Cys Ser Ala 
595 600 605 



Cys Asp Leu Val Asn Val Asp Leu Asp Asp Cys Val Ser Glu Gin 
610 615 620 
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<Z10> 6 
<211> 1641 
<212> DNA 
<213> AAV-1 

<220> 

<221> CDS 

<222> (1) . . (1638) 

<400> 6 

atg ccg ggc ttc tac gag ate gtg ate aag gtg ccg age gac ctg gac 48 

Met Pro Gly Phe Tyr Glu He Val He Lys Val Pro Ser Asp Leu Asp 

1.5 10 15 

gag cac ctg ccg ggc att tct gac teg ttt gtg age tgg gtg gee gag 96 
Glu His Leu Pro Gly He Ser Asp Ser Phe Val Ser Trp Val Ala Glu 
20 25 30 

aag gaa tgg gag ctg ccc ccg gac tct gac atg gat ctg aat ctg att 144 
Lys Glu Trp Giu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu He 
35 40 45 

gag cag gca ccc ctg acc gtg gee gag aag ctg cag cgc gac ttc ctg 192 
Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 
50 55 60 

gtc caa tgg cgc cgc gtg agt aag gee ccg gag gee etc ttc ttt gtt 240 
Val Gin Trp Arg Arg Val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 
65 70 75 80 

cag ttc gag aag ggc gag tec tac ttc cac etc cat att ctg gtg gag 288 
Gin Phe Glu Lys Gly Glu Ser Tyr Phe His Leu His He Leu Val Glu 
85 90 95 

acc acg ggg gtc aaa tec atg gtg ctg ggc cgc ttc ctg agt cag att 336 
Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gin He 
100 105 HO 

agg gac aag ctg gtg cag acc ate tac cgc ggg ate gag ccg acc ctg 384 
Arg Asp Lys Leu Val Gin Thr He Tyr Arg Gly He Glu Pro Thr Leu 
115 120 125 

ccc aac tgg ttc gcg gtg acc aag acg cgt aat ggc gee gga ggg ggg 432 
Pro Asn Trp Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 
130 135 140 

aac aag gtg gtg gac gag tgc tac ate ccc aac tac etc ctg ccc aag 480 
Asn Lys Val Val Asp Glu Cys Tyr He Pro Asn Tyr Leu Leu Pro Lys 
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145- 150 155 160 

act cag ccc gag ctg cag tgg gcg tgg act aac atg gag gag tat ata 528 
Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Glu Tyr He 
165 170 175 

age gec tgt ttg aac ctg gec gag cgc aaa egg etc gtg gcg cag cac 576 
Ser Ala Cys Leu Asn Leu Ala Glu Arg Lys Arg Leu Val Ala Gin His 
180 185 190 

ctg acc cac gtc age cag acc cag gag cag aac aag gag aat ctg aac 624 
Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Leu Asn 
195 200 205 

ccc aat tct gac gcg cct gtc ate egg tea aaa acc tec gcg cgc tac 672 
Pro Asn Ser Asp Ala Pro Val He Arg Ser Lys Thr Ser Ala Arg Tyr 
210 215 220 

atg gag ctg gtc ggg egg ctg gtg gac egg ggc ate acc tec gag aag 720 
Met Glu Leu Val Gly Trp Leu Val Asp Arg Gly He Thr Ser Glu Lys 
225 230 235 240 

cag tgg ate cag gag gac cag gec teg tac ate tec ttc aac gee get 768 
Gin Trp He Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 
245 250 255 

tec aac teg egg tec cag ate aag gee get ctg gac aat gee ggc aag 816 
Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 
260 265 270 

ate atg gcg ctg acc aaa tec gcg ccc gac tac ctg gta ggc ccc get 864 
He Met Ala Leu Thr Lys Ser Ala ' Pro Asp Tyr Leu Val Gly Pro Ala 
275 280 285 

ccg ccc gcg gac att aaa acc aac cgc ate tac cgc ate ctg gag ctg 912 
Pro Pro Ala Asp lie Lys Thr Asn Arg He Tyr Arg He Leu Glu Leu 
290 295 300 

aac ggc tac gaa cct gec tac gec ggc tec gtc ttt etc ggc tgg gec 960 
Asn Gly Tyr Glu Pro Ala Tyr Ala Gly Ser Val Phe Leu Gly Trp Ala 
305 310 315 320 

cag aaa agg ttc ggg aag cgc aac acc ate tgg ctg ttt ggg ccg gec 1008 
Gin Lys Arg Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 
325 330 335 

acc acg ggc aag acc aac ate gcg gaa gee ate gec cac gec gtg ccc 1056 
Thr Thr Gly Lys Thr Asn He Ala Glu Ala He Ala His Ala Val Pro 
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340 345 350 

ttc tac ggc tgc gtc aac tgg acc aat gag aac ttt ccc ttc aat gat 1104 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

355 360 365 

tgc gtc gac aag atg gtg ate tgg tgg gag gag ggc aag atg acg gec 1152 

Cys Val Asp Lys Met Val lie Trp Trp Glu Glu Gly Lys Met Thr Ala 
370 375 380 

aag gtc gtg gag tec gec aag gec att etc ggc ggc age aag gtg cgc 1200 

Lys Val Val Glu Ser Ala Lys Ala lie Leu Gly Gly Ser Lys Val Arg 

385 390 395 400 

gtg gac caa aag tgc aag teg tec gec cag ate gac ccc acc ccc gtg 1248 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin lie Asp Pro Thr Pro Val 
405 410 415 

ate gtc acc tec aac acc aac atg tgc gee gtg att gac ggg aac age 1296 

lie Val Thr Ser Asn Thr Asn Met Cys Ala Val lie Asp Gly Asn Ser 
420 425 430 

acc acc ttc gag cac cag cag ccg ttg cag gac egg atg ttc aaa ttt 1344 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

435 440 445 

gaa etc acc cgc cgt ctg gag cat gac ttt ggc aag gtg aca aag cag 1392 

Glu Leu Thr Arg Arg Leu Glu His Asp Phe Gly Lys Val Thr Lys Gin 
450 455 460 

gaa gtc aaa gag ttc ttc cgc tgg gcg cag gat cac gtg acc gag gtg 1440 

Glu Val Lys Glu Phe Phe Arg Trp Ala Gin Asp His Val Thr Glu Val 

465 470 475 480 

gcg cat gag ttc tac gtc aga aag ggt gga gee aac aaa aga ccc gec 1488 

Ala His Glu Phe Tyr Val Arg Lys Gly Gly Ala Asn Lys Arg Pro Ala 
485 490 495 

ccc gat gac gcg gat aaa age gag ccc aag egg gee tgc ccc tea gtc 1536 

Pro Asp Asp Ala Asp Lys Ser Glu Pro Lys Arg Ala Cys Pro Ser Val 
500 505 510 

gcg gat cca teg acg tea gac gcg gaa gga get ccg gtg gac ttt gec 1584 

Ala Asp Pro Ser Thr Ser Asp Ala Glu Gly Ala Pro Val Asp Phe Ala 

515 520 525 



gac agg tat ggc tgc cga tgg tta tct tec aga ttg get cga gga caa 
Asp Arg Tyr Gly Cys Arg Trp Leu Ser Ser Arg Leu Ala Arg Gly Gin 
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cct etc tga 1641 

Pro Leu 

545 



<210> 7 

<211> 546 

<212> PRT 

<213> AAV-1 

<400> 7 

Met Pro Gly Phe Tyr Glu He Val He Lys Val Pro Ser Asp Leu Asp 
15 10 15 

Glu His Leu Pro Gly He Ser Asp Ser Phe Val Ser Trp Val Ala Glu 
20 25 30 

Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu He 
35 40 45 

Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 
50 55 60 

Val Gin Trp Arg Arg Val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 
65 70 75 80 

Gin Phe Glu Lys Gly Glu Ser Tyr Phe His Leu His He Leu Val Glu 
85 90 95 

Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gin He 
100 105 110 

Arg Asp Lys Leu Val Gin Thr He Tyr Arg Gly He Glu Pro Thr Leu 
115 120 125 

Pro Asn Trp Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 
130 135 140 

Asn Lys Val Val Asp Glu Cys Tyr He Pro Asn Tyr Leu Leu Pro Lys 
145 150 155 160 

Thr Gin Pro Glu Leu Gin' Trp Ala Trp Thr Asn Met Glu Glu Tyr He 
165 170 175 

Ser Ala Cys Leu Asn Leu Ala Glu Arg Lys Arg Leu Val Ala Gin His 
180 185 190 
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Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Leu Asn 
195 200 205 

Pro Asn Ser Asp Ala Pro Val lie Arg Ser Lys Thr Ser Ala Arg Tyr 
210 215 220 

Met Glu Leu Val Gly Trp Leu Val Asp Arg Gly lie Thr Ser Glu Lys 
225 230 235 240 

Gin Trp lie Gin Glu Asp Gin Ala Ser Tyr lie Ser Phe Asn Ala Ala 
245 250 255 

Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 
260 265 270 

He Met Ala Leu Thr Lys Ser Ala Pro Asp Tyr Leu Val Gly Pro Ala 
275 280 285 

Pro Pro Ala Asp He Lys Thr Asn Arg He Tyr Arg He Leu Glu Leu 
290 295 300 

Asn Gly Tyr Glu Pro Ala Tyr Ala Gly Ser Val Phe Leu Gly Trp Ala 
305 310 315 320 

Gin Lys Arg Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 
325 330 ■ 335 

Thr Thr Gly Lys Thr Asn He Ala Glu Ala lie Ala His Ala Val Pro 
340 345 350 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 
355 360 365 

Cys Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 
370 375 380 

Lys Val Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg 
385 390 395 400 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 
405 410 415 

He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 
420 425 430 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 
435 440 445 
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Glu Leu Thr Arg Arg 
450 

Glu Val Lys Glu Phe 
465 

Ala His Glu Phe Tyr 
485 

Pro Asp Asp Ala Asp 
500 

Ala Asp Pro Ser Thr 
515 

Asp Arg Tyr Gly Cys 
530 

Pro Leu 
545 



Leu Glu His Asp Phe Gly 
455 

Phe Arg Trp Ala Gin Asp 
470 475 

Val Arg Lys Gly Gly Ala 
490 

Lys Ser Glu Pro Lys Arg 
505 

Ser Asp Ala Glu Gly Ala 
520 

Arg Trp Leu Ser Ser Arg 
535 



Lys Val Thr Lys Gin 
460 

His Val Thr Glu Val 
480 

Asn Lys Arg Pro Ala 
495 

Ala Cys Pro Ser Val 
510 

Pro Val Asp Phe Ala 
525 

Leu Ala Arg Gly Gin 
540 



<210> 8 
<211> 1200 
<212> DNA 
<213> AAV-1 

<220> 

<221> CDS 

<222> (1) . . (11S7) 

<400> 8 

atg gag ctg gtc ggg tgg ctg gtg gac 

Met Glu Leu Val Gly Trp Leu Val Asp 
1 5 

cag tgg ate cag gag gac cag gec teg 
Gin Trp lie Gin Glu Asp Gin Ala Ser 
20 25 

tec aac teg egg tec cag ate aag gee 
Ser Asn Ser Arg Ser Gin lie Lys Ala 
35 40 

ate atg gcg ctg acc aaa tec gcg ccc 
lie Met Ala Leu Thr Lys Ser Ala Pro 
50 55 



egg ggc ate acc tec gag aag 48 
Arg Gly lie Thr Ser Glu Lys 
10 15 

tac ate tec ttc aac gec get 96 
Tyr lie Ser Phe Asn Ala Ala 
30 

get ctg gac aat gec ggc aag 144 
Ala Leu Asp Asn Ala Gly Lys 
45 

gac tac ctg gta ggc ccc get 192 
Asp Tyr Leu Val Gly Pro Ala 
60 
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ccg ccc gcg gac att aaa acc aac cgc ate tac cgc ate ctg gag ctg 240 
Pro Pro Ala Asp lie Lys Thr Asn Arg lie Tyr Arg lie Leu Glu Leu 
65 70 75 4 80 

aac ggc tac gaa cct gec tac gec ggc tec gtc ttt etc ggc tgg gec 288 
Asn Gly Tyr Glu Pro Ala Tyr Ala Gly ser Val Phe Leu Gly Trp Ala 
85 SO 95 

cag aaa agg ttc ggg aag cgc aac acc ate tgg ctg ttt ggg ccg gee 336 
Gin Lys Arg Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 
100 105 110 

acc acg ggc aag acc aac ate gcg gaa gec ate gec cac gec gtg ccc 384 
Thr Thr Gly Lys Thr Asn lie Ala Glu Ala He Ala His Ala Val Pro 
115 120 125 

ttc tac ggc tgc gtc aac tgg acc aat gag aac ttt ccc ttc aat gat 432 
Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 
130 135 140 

tgc gtc gac aag atg gtg ate tgg tgg gag gag ggc aag atg acg gee 480 
Cys Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 
145 150 155 160 

aag gtc gtg gag tec gec aag gee att etc ggc ggc age aag gtg cgc 528 
Lys Val Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg 
165 170 175 

gtg gac caa aag tgc aag teg tec gec cag ate gac ccc acc ccc gtg 576 
Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 
180 185 190 

ate gtc acc tec aac acc aac atg tgc gec gtg att gac ggg aac age 624 
He Val Thr Ser Asn Thr Asn Met Cys Ala Val lie Asp Gly Asn Ser 
195 200 205 

acc acc ttc gag -cac cag cag ccg ttg cag gac egg atg ttc aaa ttt 672 
Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 
210 215 220 

gaa etc acc cgc cgt ctg gag cat gac ttt ggc aag gtg aca aag cag 720 
Glu Leu Thr Arg Arg Leu Glu His Asp Phe Gly Lys Val Thr Lys Gin 
225 230 235 240 

gaa gtc aaa gag ttc ttc cgc tgg gcg cag gat cac gtg acc gag gtg 768 
Glu Val Lys Glu Phe Phe Arg Trp Ala Gin Asp His Val Thr Glu Val 
245 250 255 
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gcg cat gag ttc tac gtc aga aag ggt gga gcc aac aaa aga ccc gcc 816 

Ala His Glu Phe Tyr Val Arg Lys Gly Gly Ala Asn Lys Arg Pro Ala 
260 265 270 

ccc gat gac gcg gat aaa age gag ccc aag egg gcc tgc ccc tea gtc 864 

Pro Asp Asp Ala Asp Lys Ser Glu Pro Lys Arg Ala Cys Pro Ser Val 

275 280 285 

gcg gat cca teg acg tea gac gcg gaa gga get ccg gtg gac ttt gcc 912 

Ala Asp Pro Ser Thr Ser Asp Ala Glu Gly Ala Pro Val Asp Phe Ala 

290 295 300 

gac agg tac caa aac aaa tgt tct cgt cac gcg ggc atg ctt cag atg 960 

Asp Arg Tyr Gin Asn Lys Cys Ser Arg His Ala Gly Met Leu Gin Met 
305 310 315 3'20 

ctg ttt ccc tgc aag aca tgc gag aga atg aat cag aat ttc aac att 1008 

Leu Phe Pro Cys Lys Thr Cys Glu Arg Met Asn Gin Asn Phe Asn lie 
325 330 335 

tgc ttc acg cac ggg acg aga gac tgt tea gag tgc ttc ccc ggc gtg 1056 

Cys Phe Thr His Gly Thr Arg Asp Cys Ser Glu Cys Phe Pro Gly Val 
340 345 350 

tea gaa tct caa ccg gtc gtc aga aag agg acg tat egg aaa etc tgt 1104 

Ser Glu Ser Gin Pro Val Val Arg Lys Arg Thr Tyr Arg Lys Leu Cys 

355 360 365 

gcc att cat cat ctg ctg ggg egg get ccc gag att get tgc teg gcc 1152 

Ala lie His His Leu Leu Gly Arg Ala Pro Glu lie Ala Cys Ser Ala 

370 375 380 



tgc gat ctg gtc aac gtg gac ctg gat gac tgt gtt tct gag caa taa 
Cys Asp Leu Val Asn Val Asp Leu Asp Asp Cys Val Ser Glu Gin 
385 390 3S5 



1200 



<210> 9 
<211> 399 
<212> PRT 
<213> AAV-1 



<400> 9 

Met Glu Leu Val Gly Trp Leu Val Asp Arg Gly lie Thr Ser Glu Lys 

15 10 15 

Gin Trp lie Gin Glu Asp Gin Ala Ser Tyr lie Ser Phe Asn Ala Ala 
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30 



Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 
35 40 45 



He Met Ala Leu Thr Lys Ser Ala Pro Asp Tyr Leu Val Gly Pro Ala 
50 55 60 



Pro Pro Ala Asp 
65 

Asn Gly Tyr Glu 



Gin Lys Arg Phe 
100 

Thr Thr Gly Lys 
115 

Phe Tyr Gly Cys 
130 

Cys Val Asp Lys 
145 

Lys Val Val Glu 



Val Asp Gin Lys 
180 

He Val Thr Ser 
195 



He Lys Thr Asn 
70 

Pro Ala Tyr Ala 
85 

Gly Lys Arg Asn 



Thr Asn He Ala 
120 

Val Asn Trp Thr 
135 

Met Val He Trp 
150 

Ser Ala Lys Ala 
165 

Cys Lys Ser Ser 



Asn Thr Asn Met 
200 



Arg He Tyr Arg 
75 

Gly Ser Val Phe 
90 

Thr He Trp Leu 
105 

Glu Ala He Ala 



Asn Glu Asn Phe 
140 

Trp Glu Glu Gly 
155 

He Leu Gly Gly 
170 

Ala Gin He Asp 
185 

Cys Ala Val He 



He Leu Glu Leu 
80 

Leu Gly Trp Ala 
95 

Phe Gly Pro Ala 
110 

His Ala Val Pro 
125 

Pro Phe Asn Asp 



Lys Met Thr Ala 
160 

Ser Lys Val Arg 
175 

Pro Thr Pro Val 
190 

Asp Gly Asn Ser 
205 



Thr Thr Phe Glu His Gin Gin Pro 
210 215 

Glu Leu Thr Arg Arg Leu Glu His 
225 230 

Glu Val Lys Glu Phe Phe Arg Trp 
245 

Ala His Glu Phe Tyr Val Arg Lys 
260 

Pro Asp Asp Ala Asp Lys Ser Glu 



Leu Gin Asp Arg Met Phe Lys Phe 
220 

Asp Phe Gly Lys Val Thr Lys Gin 
235 240 

Ala Gin Asp His Val Thr Glu Val 
250 255 

Gly Gly Ala Asn Lys Arg Pro Ala 
265 270 

Pro Lys Arg Ala Cys Pro Ser Val 
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275 230 285 

Ala Asp Pro Ser Thr Ser Asp Ala Glu Gly Ala Pro Val Asp Phe Ala 
290 295 300 

Asp Arg Tyr Gin Asn Lys Cys Ser Arg His Ala Gly Met Leu Gin Met 
305 310 315 320 

Leu Phe Pro Cys Lys Thr Cys Glu Arg Met Asn Gin Asn Phe Asn lie 
325 330 335 

Cys Phe Thr His Gly Thr Arg Asp Cys Ser Glu Cys Phe Pro Gly Val 
340 345 350 

Ser Glu Ser Gin Pro Val Val Arg Lys Arg Thr Tyr Arg Lys Leu Cys 
355 360 365 

Ala lie His His Leu Leu Gly Arg Ala Pro Glu lie Ala Cys Ser Ala 
370 375 380 

Cys Asp Leu Val Asn Val Asp Leu Asp Asp Cys Val Ser Glu Gin 
385 390 395 



<210> 10 
<211> 969 
<212> DNA 
<213> AAV-1 

<220> 
<221> CDS 
<222> (1) . . (966) 

<220> 

<221> misc_f eature 
<222> (943) . . (944) 
<223> minor splice site 

<400> 10 

atg gag ctg gtc ggg tgg ctg gtg gac egg ggc ate ace tec gag aag 

Met Glu Leu Val Gly Trp Leu Val Asp Arg Gly lie Thr Ser Glu Lys 

15 10 15 

cag tgg ate cag gag gac cag gec teg tac ate tec ttc aac gec get 96 
Gin Trp lie Gin Glu Asp Gin Ala Ser Tyr lie Ser Phe Asn Ala Ala 
20 25 30 

tec aac teg egg tec cag ate aag gec get ctg gac aat gee ggc aag 144 



48 
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Ser Asn Ser Arg Ser Gin lie Lys Ala Ala Leu Asp Asn Ala Gly Lys 
35 40 45 

ate atg gcg ctg acc aaa tec gcg ccc gac tac ctg gta ggc ccc 'get 192 
lie Met Ala Leu Thr Lys Ser Ala Pro Asp Tyr Leu Val Gly Pro Ala 
50 55 60 

ccg ccc gcg gac att aaa acc aac cgc ate tac cgc ate ctg gag ctg 240 
Pro Pro Ala Asp He Lys Thr Asn Arg He Tyr Arg He Leu Glu Leu 
65 70 75 80 

aac ggc tac gaa cct gec tac gec ggc tec gtc ttt etc ggc tgg gec 288 
Asn Gly Tyr Glu Pro Ala Tyr Ala Gly Ser Val Phe Leu Gly Trp Ala 
85 90 95 

cag aaa agg ttc ggg aag cgc aac acc ate tgg ctg ttt ggg ccg gec 336 
Gin Lys Arg Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 
100 105 110 

acc acg ggc aag acc aac ate gcg gaa gee ate gec cac gee gtg ccc 384 
Thr Thr Gly Lys Thr Asn He Ala Glu Ala He Ala His Ala Val Pro 
115 120 125 

ttc tac ggc tgc gee aac tgg acc aat gag aac ttt ccc ttc aat gat 432 
Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 
130 135 140 

tgc gtc gac aag atg gtg ate tgg tgg gag gag ggc aag atg acg gec 480 
Cys Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 
145 150 155 160 

aag gtc gtg gag tec gee aag gee att etc ggc ggc age aag gtg cgc 528 
Lys Val Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg 
165 170 175 

gtg gac caa aag tgc aag teg tec gec cag ate gac ccc acc ccc gtg 576 
Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 
180^ 185 190 

ate gtc acc tec aac acc aac atg tgc gee gtg att gac ggg aac age 624 
He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 
195 200 205 

acc acc ttc gag cac cag cag ccg ttg cag gac egg atg ttc aaa ttt 672 
Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 
210 215 220 

gaa etc acc cgc cgt ctg gag cat gac ttt ggc aag gtg aca aag cag 720 
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Glu Leu Thr Arg Arg Leu Glu His Asp Phe Gly Lys Val Thr Lys Gin 

225 . 230 235 240 

gaa gtc aaa gag ttc ttc cgc tgg gcg cag gat cac gtg acc gag' gtg 768 

Glu Val Lys Glu Phe Phe Arg Trp Ala Gin Asp His Val Thr Glu Val 

245 250 255 

gcg cat gag ttc tac gtc aga aag ggt gga gcc aac aaa aga ccc gcc 816 

Ala His Glu Phe Tyr Val Arg Lys Gly Gly Ala Asn Lys Arg Pro Ala 

260 265 270 

ccc gat gac gcg gat aaa age gag ccc aag egg gcc tgc ccc tea gtc 864 

Pro Asp Asp Ala Asp Lys Ser Glu Pro Lys Arg Ala Cys Pro Ser Val 

275 280 285 

gcg gat cca teg acg tea gac gcg gaa gga get ccg gtg gac ttt gcc 912 

Ala Asp Pro Ser Thr Ser Asp Ala Glu Gly Ala Pro Val Asp Phe Ala 

290 295 300 

gac agg tat ggc tgc cga tgg tta tct tec aga ttg get cga gga caa 960 

Asp Arg Tyr Gly Cys Arg Trp Leu Ser Ser Arg Leu Ala Arg Gly Gin 

305 310 315 320 

cct etc tga 969 
Pro Leu 



<210> 11 
<211> 322 
<212> PRT 
<213> AAV-1 

<400> 11 

Met Glu Leu Val Gly Trp Leu Val Asp Arg Gly lie Thr Ser Glu Lys 
15 10 15 

Gin Trp lie Gin Glu Asp Gin Ala Ser Tyr lie Ser Phe Asn Ala Ala 
20 ^ 25 30 

Ser Asn Ser Arg Ser Gin lie Lys Ala Ala Leu Asp Asn Ala Gly Lys 
35 40 45 



lie Met Ala Leu Thr Lys Ser Ala Pro Asp Tyr Leu Val Gly Pro Ala 

50 55 60 

Pro Pro Ala Asp He Lys Thr Asn Arg He Tyr Arg He Leu Glu Leu 

65 70 75 80 
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As ft Gly Tyr Glu Pro Ala Tyr Ala Gly Ser Val Phe Leu Gly Trp Ala 
85 90 95 

Gin Lys Arg Phe Gly Lys Arg Asn Thr lie Trp Leu Phe Gly ProAla 
100 105 110 

Thr Thr Gly Lys Thr Asn He Ala Glu Ala He Ala His Ala Val Pro 
115 120 125 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 
130 135 140 

Cys Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 
145 150 155 160 

Lys Val Val Glu Ser Ala Lys Ala He Leu ,Gly Gly Ser Lys Val Arg 
165 170 175 

Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 
180 185 190 

He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 
195 200 205 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 
210 215 220 

Glu Leu Thr Arg Arg Leu Glu His Asp Phe Gly Lys Val Thr Lys Gin 
225 230 235 240 

Glu Val Lys Glu Phe Phe Arg Trp Ala Gin Asp His Val Thr Glu Val 
245 250 255 

Ala His Glu Phe Tyr Val Arg Lys Gly Gly Ala Asn Lys Arg Pro Ala 
260 265 270 

Pro Asp Asp Ala Asp Lys Ser Glu Pro Lys Arg Ala Cys Pro Ser Val 
275 " 280 285 

Ala Asp Pro Ser Thr Ser Asp Ala Glu Gly Ala Pro Val Asp Phe Ala 
290 295 300 

Asp Arg Tyr Gly Cys Arg Trp Leu Ser Ser Arg Leu Ala Arg Gly Gin 
305 310 315 320 

Pro Leu 
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<210> 12 

<211> 2211 

<212> DNA 

<213> AAV-1 



<220> 

<221> CDS 

<222> (1) . . (2208) 

<400> 12 

atg get gec gat ggt tat ctt cca gat tgg etc gag gac aac etc tct 48 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 



gag ggc att cgc gag tgg tgg gac ttg aaa cct gga gee ccg aag ccc 96 
Glu Gly lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



aaa gee aac cag caa aag cag gac gac ggc egg ggt ctg gtg ctt cct 144 

Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 

35 40 45 

ggc tac aag tac etc gga ccc ttc aac gga etc gac aag ggg gag ccc 192 

Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 

50 55 60 



gtc aac gcg gcg gac gca gcg gec etc gag cac gac aag gee tac gac 240 
Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



cag cag etc aaa gcg ggt gac aat ccg tac ctg egg tat aac cac gee 288 
Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 

gac gec gag ttt cag gag cgt ctg caa gaa gat acg tct ttt ggg ggc 336 
Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



aac etc ggg cga gca gtc ttc cag gec aag aag egg gtt etc gaa cct 
Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



384 



etc ggt ctg gtt gag gaa ggc get aag acg get cct gga aag aaa cgt 
Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys' Arg 
130 135 140 



432 



ccg gta gag cag teg cca caa gag cca gac tec tec teg ggc ate ggc 480 
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Pro-Val Glu 
145 

aag aca ggc 
Lys Thr Gly 



ggc gac tea 
Gly Asp Ser 



gca acc ccc 
Ala Thr Pro 
195 

gca cca atg 
Ala Pro Met 
210 

tea gga aat 
Ser Gly Asn 
225 

acc acc age 
Thr Thr Ser 



tac aag caa 
Tyr Lys Gin 

tac ttc ggc 
Tyr Phe Gly 
275 

cac tgc cac 
His Cys His 
290 

tgg gga ttc 
Trp Gly Phe 
305 

gtc aag gag 
Val Lys Glu 



ctt acc age 



Gin Ser Pro 
150 

cag cag ccc 
Gin Gin ' Pro 
165 

gag tea gtc 
Glu Ser Val 
180 

get get gtg 
Ala Ala Val 



gca gac aat 
Ala Asp Asn 



tgg cat tgc 
Trp His Cys 
230 

acc cgc acc 
Thr Arg Thr 
245 

ate tec age 
lie Ser Ser 
260 

tac age acc 
Tyr Ser Thr 

ttt tea cca 
Phe Ser Pro 



egg ccc aag 
Arg Pro Lys 
310 

gtc acg acg 
Val Thr Thr 
325 

acg gtt caa 



Gin Glu Pro 



get aaa aag 
Ala Lys Lys 



ccc gat cca 
Pro Asp Pro 
185 

gga cct act 
Gly Pro Thr 
200 

aac gaa ggc 
Asn Glu Gly 
215 

gat tec aca 
Asp Ser Thr 



tgg gec ttg 
Trp Ala Leu 



get tea acg 
Ala Ser Thr 
265 

ccc tgg ggg 
Pro Trp Gly 
280 

cgt gac tgg 
Arg Asp Trp 
295 

aga etc aac 
Arg Leu Asn 



aat gat ggc 
Asn Asp Gly 



gtc ttc teg 



Asp Ser Ser 
155 

aga etc aat 
Arg Leu Asn 
170 

caa cct etc 
Gin Pro Leu 



aca atg get 
Thr Met Ala 



gec gac gga 
Ala Asp Gly 
220 

tgg ctg ggc 
Trp Leu Gly 
235 

ccc acc tac 
Pro Thr Tyr 
250 

ggg gee age 
Gly Ala Ser 

tat ttt gat 
Tyr Phe Asp 

cag cga etc 
Gin Arg Leu 
300 

ttc aaa etc 
Phe Lys Leu 
315 

gtc aca acc 
Val Thr Thr 
330 

gac teg gag 



Ser Gly He 



ttt ggt cag 
Phe Gly Gin 
175 

gga gaa cct 
Gly Glu Pro 
190 

tea ggc ggt 
Ser Gly Gly 
205 

gtg ggt aat 
Val Gly Asn 



gac aga gtc 
Asp Arg Val 

aat aac cac 
Asn Asn His 
255 

aac gac aac 
Asn Asp Asn 
270 

ttc aac aga 
Phe Asn Arg 
285 

ate aac aac 
He Asn Asn 



ttc aac ate 
Phe Asn He 



ate get aat 
He Ala Asn 
335 

tac cag ctt 



Gly 
160 

act 528 
Thr 



cca 576 
Pro 



ggc 624 
Gly 



gec 672 
Ala 



ate 720 

He 

240 

etc 768 
Leu 



cac 816 
His 



ttc 864 
Phe 



aat 912 
Asn 



caa 960 

Gin 

320 

aac 1008 
Asn 



ccg 1056 
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Leu Thr Ser Thr Val Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu Pro 
340 345 350 

tac gtc etc ggc tct gcg cac cag ggc tgc etc cct ccg ttc ccg 'gcg 1104 

Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala 
355 360 365 

gac gtg ttc atg att ccg caa tac ggc tac ctg acg etc aac aat ggc 1152 

Asp Val Phe Met lie Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly 
370 375 380 



age caa gec gtg gga cgt tea tec ttt tac tgc ctg gaa tat ttc cct 
Ser Gin Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro 
385 390 3S5 400 



1200 



tct cag atg ctg aga acg ggc aac aac ttt acc ttc age tac ace ttt 1248 

Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe 
405 410 415 

gag gaa gtg cct ttc cac age age tac gcg cac age cag age ctg gac 1296 

Glu Glu Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp 

420 425 430 

egg ctg atg aat cct etc ate gac caa tac ctg tat tac ctg aac aga 1344 

Arg Leu Met Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu Asn Arg 

435 440 445 

act caa aat cag tec gga agt gee caa aac aag gac ttg ctg ttt age 1392 

Thr Gin Asn Gin Ser Gly Ser Ala Gin Asn Lys Asp Leu Leu Phe Ser 

450 455 460 

cgt ggg tct cca get ggc atg tct gtt cag ccc aaa aac tgg eta cct 1440 

Arg Gly Ser Pro Ala Gly Met Ser Val Gin Pro Lys Asn Trp Leu Pro 

465 470 475 480 

gga ccc tgt tat egg cag cag cgc gtt tct aaa aca aaa aca gac aac 1488 

Gly Pro Cys Tyr Arg Gin Gin Arg Val Ser Lys Thr Lys Thr Asp Asn 
435 490 495 

aac aac age aat ttt acc tgg act ggt get tea aaa tat aac etc aat 1536 

Asn Asn Ser Asn Phe Thr Trp Thr Gly Ala Ser Lys Tyr Asn Leu Asn 

500 505 510 

ggg cgt gaa tec ate ate aac cct ggc act get atg gee tea cac aaa 1584 

Gly Arg Glu Ser He He Asn Pro Gly Thr Ala Met Ala Ser His Lys 

515 520 525 

gac gac gaa gac aag ttc ttt ccc atg age ggt gtc atg att ttt gga 1632 



36 



WO 00/28061 



PCT/US99/25694 



Asp Asp Glu 
530 

aaa gag age 

Lys Glu Ser 
545 

aca gac gaa 

Thr Asp Glu 



ttt ggg acc 
Phe Gly Thr 



acc gga gat 
Thr Gly Asp 
595 

gat aga gac 
Asp Arg Asp 
610 

aca gat gga 
Thr Asp Gly 
625 

aag aac ccg 
Lys Asn Pro 



aat cct ccg 
Asn Pro Pro 



caa tac tec 
Gin Tyr Ser 
675 

aaa gaa aac 
Lys Glu Asn 
690 

tat gca aaa 
Tyr Ala Lys 
705 

tat act gag 



Asp Lys Phe 



gec gga get 
Ala Gly Ala 
550 

gag gaa att 
Glu Glu lie 
565 

gtg gca gtc 
Val Ala Val 
580 

gtg cat get 
Val His Ala 



gtg tac ctg 
Val Tyr Leu 



cac ttt cac 
His Phe His 
630 

cct cct cag 
Pro Pro Gin 
645 

gcg gag ttt 
Ala Glu Phe 
660 

aca gga caa 
Thr Gly Gin 



age aag cgc 
Ser Lys Arg 



tct gec aac 
Ser Ala Asn 
710 

cct cgc ccc 



Phe Pro Met 
535 

tea aac act 
Ser Asn Thr 



aaa gee act 
Lys Ala Thr 



aat ttc cag 
Asn Phe Gin 
585 

atg gga gca 
Met Gly Ala 
600 

cag ggt ccc 
Gin Gly Pro 
615 

ccg tct cct 
Pro Ser Pro 



ate etc ate 
He Leu He 



tea get aca 
Ser Ala Thr 
665 

gtg agt gtg 
Val Ser Val 
680 

tgg aat ccc 
Trp Asn Pro 
695 

gtt gat ttt 
Val Asp Phe 



att ggc acc 



Ser Gly Val 
540 

gca ttg gac 
Ala Leu Asp 
555 

aac cct gtg 
Asn Pro Val 
570 

age age age 
Ser Ser Ser 



tta cct ggc 
Leu Pro Gly 



att tgg gee 
He Trp Ala 
620 

ct t atg ggc 
Leu Met Gly 
635 

aaa aac acg 
Lys Asn Thr 
650 

aag ttt get 
Lys Phe Ala 



gaa att gaa 
Glu He Glu 



gaa gtg cag 
Glu Val Gin 
700 

act gtg gac 
Thr Val Asp 
715 

cgt tac ctt 



Met He Phe 



aat gtc atg 
Asn Val Met 



gec acc gaa 
Ala Thr Glu 
575 

aca gac cct 
Thr Asp Pro 
590 

atg gtg tgg 
Met Val Trp 
605 

aaa att cct 
Lys He Pro 

ggc ttt gga 
Gly Phe Gly 

cct gtt cct 
Pro Val Pro 
655 

tea ttc ate 
Ser Phe He 
670 

tgg gag ctg 
Trp Glu Leu 
685 

tac aca tec 
Tyr Thr Ser 



aac aat gga 
Asn Asn Gly 



acc cgt ccc 



Gly 



att 1680 

He 

560 

aga 1728 
Arg 



gcg 1776 
Ala 



caa 1824 
Gin 



cac 1872 
His 

etc 1920 

Leu 

640 

gcg 1968 
Ala 



acc 2016 
Thr 



cag 2064 
Gin 



aat 2112 
Asn 



ctt 2160 

Leu 

720 

ctg 2208 
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Tyr. Thr Glu Pro Arg Pre lie Gly Thr Arg Tyr Leu Thr Arg Pro Leu 
725 730 735 

taa ' 2211 



<210> 13 
<211> 736 
<212> PRT 
<213> AAV-1 



<400> 13 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 

Glu Gly lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 

Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 

Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 . 

Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 

Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 SO 95 

Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 

Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 

Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys -Arg 
130 * 135 140 

Pro Val Glu Gin Ser Pro Gin Glu Pro Asp Ser Ser Ser Gly lie Gly 
145 150 155 160 

Lys Thr Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin Thr 
165 170 175 



Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro Pro 
180 135 190 
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Ala. Thr Pro Ala Ala Val Gly Pro Thr Thr Met Ala Ser Gly Gly Gly 
195 200 205 



Ala Pro Met Ala Asp Asn Asn Glu 
210 215 

Ser Gly Asn Trp His Cys Asp Ser 
225 230 

Thr Thr Ser Thr Arg Thr Trp Ala 
245 

Tyr Lys Gin He Ser Ser Ala Ser 
260 

Tyr Phe Gly Tyr Ser Thr Pro Trp 
275 280 



Gly Ala Asp Gly Val Gly Asn 'Ala 
220 

Thr Trp Leu Gly Asp Arg Val He 
235 240 

Leu Pro Thr Tyr Asn Asn His Leu 
250 255 

Thr Gly Ala Ser Asn Asp Asn His 
265 270 

Gly Tyr Phe Asp Phe Asn Arg Phe 
285 



His Cys His Phe 
290 

Trp Gly Phe Arg 
305 

Val Lys Glu Val 



Leu Thr Ser Thr 
340 

Tyr Val Leu Gly 
355 

Asp Val Phe Met 
370 

Ser Gin Ala Val 
385 

Ser Gin Met Leu 



Glu Glu Val Pro 
420 

Arg Leu Met Asn 
435 



Ser Pro Arg Asp 
295 

Pro Lys Arg Leu 
310 

Thr Thr Asn Asp 
325 

Val Gin Val Phe 



Ser Ala His Gin 
360 

He Pro Gin Tyr 
375 

Gly Arg Ser Ser 
390 

Arg Thr Gly Asn 
405 

Phe His Ser Ser 



Pro Leu lie Asp 
440 



Trp Gin Arg Leu 
300 

Asn Phe Lys Leu 
315 

Gly Val Thr Thr 
330 

Ser Asp Ser Glu 
345 

Gly Cys Leu Pro 



Gly Tyr Leu Thr 
380 

Phe Tyr Cys Leu 
395 

Asn Phe Thr Phe 
410 

Tyr Ala His Ser 
425 

Gin Tyr Leu Tyr 



He Asn Asn Asn 



Phe Asn He Gin 

.320 

He Ala Asn Asn 
335 

Tyr Gin Leu Pro 
350 

Pro Phe Pro Ala 
365 

Leu Asn Asn Gly 



Glu Tyr Phe Pro 
400 

Ser Tyr Thr Phe 
415 

Gin Ser Leu Asp 
430 

Tyr Leu Asn Arg 
445 



39 



WO 00/28061 



PCT/US99/25694 



Thr_Gln Asn Gin Ser Gly Ser Ala Gin Asn Lys Asp Leu Leu Phe Ser 
450 455 460 

Arg Gly Ser Pro Ala Gly Met Ser Val Gin Pro Lys Asn Trp Leu Pro 
465 470 475 480 

Gly Pro Cys Tyr Arg Gin Gin Arg Val Ser Lys Thr Lys Thr Asp Asn 
485 490 495 

Asn Asn Ser Asn Phe Thr Trp Thr Gly Ala Ser Lys Tyr Asn Leu Asn 
500 505 510 

Gly Arg Glu Ser lie lie Asn Pro Gly Thr Ala Met Ala Ser His Lys 
515 520 525 

Asp Asp Glu Asp Lys Phe Phe Pro Met Ser Gly Val Met lie Phe Gly 
530 535 540 

Lys Glu Ser Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn Val Met He 
545 550 555 560 

Thr Asp Glu Glu Glu He Lys Ala Thr Asn Pro Val Ala Thr Glu Arg 
565 570 575 

Phe Gly Thr Val Ala Val Asn Phe Gin Ser Ser Ser Thr Asp Pro Ala 
580 585 590 

Thr Gly Asp Val His Ala Met Gly Ala Leu Pro Gly Met Val Trp Gin 
595 600 605 

Asp Arg Asp Val Tyr Leu Gin Gly Pro He Trp Ala Lys He Pro His 
610 615 620 

Thr Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
625 630 635 640 

Lys. Asn Pro Pro Pro Gin He Leu He Lys Asn Thr Pro Val Pro Ala 
645 650 655 

Asn Pro Pro Ala Glu Phe Ser Ala Thr Lys Phe Ala Ser Phe He Thr 
660 665 670 

Gin Tyr Ser Thr Gly Gin Val Ser Val Glu He Glu Trp Glu Leu Gin 
675 6S0 685 

Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Val Gin Tyr Thr Ser Asn 
690 6S5 700 
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Tyr _Ala Lys Ser Ala Asn Val Asp Phe Thr Val Asp Asn Asn Gly Leu 

705 710 715 720 

Tyr Thr Glu Pro Arg Pro lie Gly Thr Arg Tyr Leu Thr Arg Pro teu 

725 730 735 



<210> 14 
<211> 1800 
<212> DNA 
<213> AAV-1 

<220> 

<221> CDS 

<222> (1) . . (17S7) 

<400> 14 

acg get cct gga aag aaa cgt ccg gta gag cag teg cca caa gag cca 48 

Thr Ala Pro Gly Lys Lys Arg Pro Val Glu Gin Ser Pro Gin Glu Pro 
15 10 15 

gac tec tec teg ggc ate ggc aag aca ggc cag cag ccc get aaa aag 96 
Asp Ser Ser Ser Gly lie Gly Lys Thr Gly Gin Gin Pro Ala Lys Lys 
20 25 30 

aga etc aat ttt ggt cag act ggc gac tea gag tea gtc ccc gat cca 144 
Arg Leu Asn Phe Gly Gin Thr Gly Asp Ser Glu Ser Val Pro Asp Pro 
35 40 45 

caa cct etc gga gaa cct cca gca ace ccc get get gtg gga cct act 192 
Gin Pro Leu Gly Glu Pro Pro Ala Thr Pro Ala Ala Val Gly Pro Thr 
50 55 60 

aca atg get tea ggc ggt ggc gca cca atg gca gac aat aac gaa ggc 240 
Thr Met Ala Ser Gly Gly Gly Ala Pro Met Ala Asp Asn Asn Glu Gly 
65 70 75 80 

gec gac gga gtg ggt aat gee tea gga aat tgg cat tgc gat tec aca 288 
Ala Asp Gly Val Gly Asn Ala Ser Gly Asn Trp His Cys Asp Ser Thr 
85 SO 95 

tgg ctg ggc gac aga gtc ate acc acc age acc cgc ace tgg gec ttg 336 
Trp Leu Gly Asp Arg Val lie Thr Thr Ser Thr Arg Thr Trp Ala Leu 
100 105 110 

ccc acc tac aat aac cac etc tac aag caa ate tec agt get tea acg 384 
Pro Thr Tyr Asn Asn His Leu Tyr Lys Gin lie Ser Ser Ala Ser Thr 
115 120 125 
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ggg gcc age aac gac aac cac tac ttc ggc tac age ace ccc tgg ggg 432 

Gly Ala Ser Asn Asp Asn His Tyr Phe Gly Tyr Ser Thr Pro Trp Gly 

130 135 140 

tat ttt gat ttc aac aga ttc cac tgc cac ttt tea cca cgt gac tgg 480 

Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp Trp 

145 150 155 160 

cag cga etc ate aac aac aat tgg gga ttc egg ccc aag aga etc aac 528 

Gin Arg Leu lie Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn 
165 170 175 

ttc aaa etc ttc aac ate caa gtc aag gag gtc acg acg aat gat ggc 576 

Phe Lys Leu Phe Asn lie Gin Val Lys Glu Val Thr Thr Asn Asp Gly 
180 185 190 

gtc aca ace ate get aat aac ctt acc age acg gtt caa gtc ttc teg 624 

Val Thr Thr lie Ala Asn Asn Leu Thr Ser Thr Val Gin Val Phe Ser 
155 200 205 

gac teg gag tac cag ctt ccg tac gtc etc ggc tct gcg cac cag ggc 672 

Asp Ser Glu Tyr Gin Leu Pro Tyr Val Leu Gly Ser Ala His Gin Gly 

210 215 220 

tgc etc cct ccg ttc ccg gcg gac gtg ttc atg att ccg caa tac ggc 720 

Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met lie Pro Gin Tyr Gly 

225 230 . 235 240 

tac ctg acg etc aac aat ggc age caa gcc gtg gga cgt tea tec ttt 768 

Tyr Leu Thr Leu Asn Asn Gly Ser Gin Ala Val Gly Arg Ser Ser Phe 
245 250 255 

tac tgc ctg gaa tat ttc cct tct cag atg ctg aga acg ggc aac aac 816 

Tyr Cys Leu Glu Tyr Phe Pro Ser Gin Met Leu Arg Thr Gly Asn Asn 
260 265 270 

ttt acc ttc age tac acc ttt gag gaa gtg cct ttc cac age age tac 864 

Phe Thr Phe Ser Tyr Thr Phe Glu Glu Val Pro Phe His Ser Ser Tyr 
275 280 285 

gcg cac age cag age ctg gac egg ctg atg aat cct etc ate gac caa 912 

Ala His Ser Gin Ser Leu Asp Arg Leu Met Asn Pro Leu lie Asp Gin 

290 295 300 

tac ctg tat tac ctg aac aga act caa aat cag tec gga agt gcc caa 960 

Tyr Leu Tyr Tyr Leu Asn Arg Thr Gin Asn Gin Ser Gly Ser Ala Gin 

305 310 315 320 
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aac aag gac 
Asn Lys Asp 



cag ccc aaa 
Gin Pro Lys 



tct aaa aca 
Ser Lys Thr 
355 

get tea aaa 
Ala Ser Lys 
370 

act get atg 
Thr Ala Met 
385 

age ggt gtc 
Ser Gly Val 



gca ttg gac 
Ala Leu Asp 



aac cct gtg 
Asn Pro Val 
435 

age age age 
Ser Ser Ser 
450 

tta cct ggc 
Leu Pro Gly 
465 

att tgg gee 
lie Trp Ala 



ctt atg ggc 
Leu Met Gly 



ttg ctg ttt 
Leu Leu Phe 
325 

aac tgg eta 
Asn Trp Leu 
340 

aaa aca gac 
Lys Thr Asp 

tat aac etc 
Tyr Asn Leu 



gec tea cac 
Ala Ser His 
390 

atg att ttt 
Met lie Phe 
405 

aat gtc atg 
Asn Val Met 
420 

gee ace gaa 
Ala Thr Glu 



aca gac cct 
Thr Asp Pro 

atg gtg tgg 
Met Val Trp 
470 

aaa att cct 
Lys He Pro 
485 

ggc ttt gga 
Gly Phe Gly 
500 



age cgc ggg 
Ser Arg Gly 

cct gga ccc 
Pro Gly Pro 
345 

aac aac aac 
Asn Asn Asn 
360 

aat ggg cgt 
Asn Gly Arg 
375 

aaa gac gac 
Lys Asp Asp 



gga aaa gag 
Gly Lys Glu 



att aca gac 
He Thr Asp 
425 

aga ttt ggg 
Arg Phe Gly 
440 

gcg acc gga 
Ala Thr Gly 
455 

caa gat aga 
Gin Asp Arg 



cac aca gat 
His Thr Asp 



etc aag aac 
Leu Lys Asn 
505 



tct cca get 

Ser Pro Ala 
330 

tgt tat egg 

Cys Tyr Arg 



age aat ttt 
Ser Asn Phe 



gaa tec ate 
Glu Ser He 
380 

gaa gac aag 
Glu Asp Lys 
395 

age gee gga 
Ser Ala Gly 
410 ' 

gaa gag gaa 
Glu Glu Glu 



acc gtg gca 
Thr Val Ala 



gat gtg cat 
Asp Val His 
460 

gac gtg tac 
Asp Val Tyr 
475 

gga cac ttt 
Gly His Phe 
490 

ccg cct cct 
Pro Pro Pro 



ggc atg tct 
Gly Met Ser 
335 

cag cag cgc 
Gin Gin Arg 
350 

acc tgg act 
Thr Trp Thr 
365 

ate aac cct 
He Asn Pro 



ttc ttt ccc 
Phe Phe Pro 

get tea aac 
Ala Ser Asn 
415 

att aaa gee 
He Lys Ala 
430 

gtc aat ttc 

Val Asn Phe 
445 

get atg gga 

Ala Met Gly 



ctg cag ggt 
Leu Gin Gly 



cac ccg tct 
His Pro Ser 
495 

cag ate etc 
Gin He Leu 
510 



gtt 1008 
Val 



gtt 1056 
Val 



ggt 1104 
Gly 



ggc 1152 
Gly 



atg 1200 

Met 

400 

act 1248 
Thr 



act 1 1296 
Thr 



cag 1344 
Gin 



gca 1392 
Ala 



ccc 1440 

Pro 

480 

cct 1488 
Pro 



ate 1536 
He 
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aaa aac acg cct gtt cct gcg aat cct ccg gcg gag ttt tea get aca 1584 
Lys Asn Thr Pro Val Pro Ala Asn Pro Pro Ala Glu Phe Ser Ala Thr 
515 520 525 

aag ttt get tea ttc ate ace caa tac tec aca gga caa gtg agt gtg 1632 
Lys Phe Ala Ser Phe lie Thr Gin Tyr Ser Thr Gly Gin Val Ser Val 
530 535 540 

gaa att gaa tgg gag ctg cag aaa gaa aac age aag cgc tgg aat ccc 1680 
Glu lie Glu Trp Glu Leu Gin Lys Glu Asn Ser Lys Arg Trp Asn Pro 
545 550 555 560 

gaa gtg cag tac aca tec aat tat gca aaa tct gec aac gtt gat ttt 1728 
Glu Val Gin Tyr Thr Ser Asn Tyr Ala Lys Ser Ala Asn Val Asp Phe 
565 570 575 

act gtg gac aac aac gga ctt tac act gag cct cgc ccc att ggc ace 1776 
Thr Val Asp Asn Asn Gly Leu Tyr Thr Glu Pro Arg Pro lie Gly Thr 
580 585 590 

cgt tac ctt acc cgt ccc ctg taa 1800 
Arg Tyr Leu Thr Arg Pro Leu 
595 



<210> 15 
<211> 599 
<212> PRT 
<213> AAV-1 

<400> 15 

Thr Ala Pro Gly Lys Lys Arg Pro Val Glu Gin Ser Pro Gin Glu Pro 
15 10 15 

Asp Ser Ser Ser Gly He Gly Lys Thr Gly Gin Gin Pro Ala Lys Lys 
20 25 30 

Arg Leu Asn Phe Gly Gin Thr Gly Asp Ser Glu Ser Val Pro Asp Pro 
35 40' 45 

Gin Pro Leu Gly Glu Pro Pro Ala Thr Pro Ala Ala Val Gly Pro Thr 
50 55 60 

Thr Met Ala Ser Gly Gly Gly Ala Pro Met Ala Asp Asn Asn Glu Gly 
65 70 75 80 

Ala Asp Gly Val Gly Asn Ala Ser Gly Asn Trp His Cys Asp Ser Thr 
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95 



Trp Leu Gly Asp 
100 

Pro Thr Tyr Asn 
115 

Gly Ala Ser Asn 
130 

Tyr Phe Asp Phe 
145 

Gin Arg Leu lie 



Phe Lys Leu Phe 
180 

Val Thr Thr lie 
195 

Asp Ser Glu Tyr 
210 

Cys Leu Pro Pro 
225 

Tyr Leu Thr Leu 



Tyr Cys Leu Glu 
260 

Phe Thr Phe Ser 
275 

Ala His Ser Gin 
290 

Tyr Leu Tyr Tyr 
305 

Asn Lys Asp Leu 



Gin Pro Lys Asn 



Arg Val He Thr 



Asn His Leu Tyr 
120 

Asp Asn His Tyr 
135 

Asn Arg Phe His 
150 

Asn Asn Asn Trp 
165 

Asn He Gin Val 



Ala Asn Asn Leu 
200 

Gin Leu Pro Tyr 
215 

Phe Pro Ala Asp 
230 

Asn Asn Gly Ser 
245 

Tyr Phe Pro Ser 



Tyr Thr Phe Glu 
280 

Ser Leu Asp Arg 
295 

Leu Asn Arg Thr 
310 

Leu Phe Ser Arg 
325 

Trp Leu Pro Gly 



Thr Ser Thr Arg 
105 

Lys Gin He Ser 



Phe Gly Tyr Ser 
140 

Cys His Phe Ser 
155 

Gly Phe Arg Pro 
170 

Lys Glu Val Thr 
185 

Thr Ser Thr Val 



Val Leu Gly Ser 
220 

Val Phe Met He 
235 

Gin Ala Val Gly 
250 

Gin Met Leu Arg 
265 

Glu Val Pro Phe 



Leu Met Asn Pro 
300 

Gin Asn Gin Ser 
315 

Gly Ser Pro Ala 
330 

Pro Cys Tyr Arg 



Thr Trp Ala Leu 
110 

Ser Ala Ser Thr 
125 

Thr Pro Trp Gly 



Pro Arg Asp Trp 
160 

Lys Arg Leu Asn 
175 

Thr Asn Asp Gly 
190 

Gin Val Phe Ser 
205 

Ala His Gin Gly 



Pro Gin Tyr Gly 
240 

Arg Ser Ser Phe 
255 

Thr Gly Asn Asn 
270 

His Ser Ser Tyr 
285 

Leu He Asp Gin 



Gly Ser Ala Gin 
320 

Gly Met Ser Val 
335 

Gin Gin Arg Val 
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340 345 350 

Ser Lys Thr Lys Thr Asp Asn Asn Asn Ser Asn Phe Thr Trp Thr Gly 
355 360 365 

Ala Ser Lys Tyr Asn Leu Asn Gly Arg Glu Ser lie lie Asn Pro Gly 
370 375 380 

Thr Ala Met Ala Ser His Lys Asp Asp Glu Asp Lys Phe Phe Pro Met 
385 3S0 395 400 

Ser Gly Val Met lie Phe Gly Lys Glu Ser Ala Gly Ala Ser Asn Thr 
405 410 415 

Ala Leu Asp Asn Val Met He Thr Asp Glu Glu Glu He Lys Ala Thr 
420 425 430 

Asn Pro Val Ala Thr Glu Arg Phe Gly Thr Val Ala Val Asn Phe Gin 
435 440 445 

Ser Ser Ser Thr Asp Pro Ala Thr Gly Asp Val His Ala Met Gly Ala 
450 455 460 

Leu Pro Gly Met Val Trp Gin Asp Arg Asp Val Tyr Leu Gin Gly Pro 
465 470 475 480 

He Trp Ala Lys lie Pro His Thr Asp Gly His Phe His Pro Ser Pro 
485 490 495 

Leu Met Gly Gly Phe Gly Leu Lys Asn Pro Pro Pro Gin He Leu He 
500 505 510 

Lys Asn Thr Pro Val Pro Ala Asn Pro Pro Ala Glu Phe Ser Ala Thr 
515 520 525 

Lys Phe Ala Ser Phe He Thr Gin Tyr Ser Thr Gly Gin Val Ser Val 
530 535 . 540 

Glu lie Glu Trp Glu Leu Gin Lys Glu Asn Ser Lys Arg Trp Asn Pro 
545 550 555 560 

Glu Val Gin Tyr Thr Ser Asn Tyr Ala Lys Ser Ala Asn Val Asp Phe 
565 570 575 

Thr Val Asp Asn Asn Gly Leu Tyr Thr Glu Pro Arg Pro He Gly Thr 
580 585 590 

Arg Tyr Leu Thr Arg Pro Leu 
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<210> 16 
<211> 1605 
<212> DNA 
<213> AAV-1 

<220> 

<221> CDS 

<222> (1) (1602) 

<400> 16 

atg get tea ggc ggt ggc gca cca atg gca gac aat aac gaa ggc gee 48 

Met Ala Ser Gly Gly Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala 
15 10 15 



gac gga gtg ggt aat gec tea gga aat tgg cat tgc gat tec aca tgg 96 

Asp Gly Val Gly Asn Ala Ser Gly Asn Trp His Cys Asp Ser Thr Trp 

20 25 30 

ctg ggc gac aga gtc ate ace ace age acc cgc ace tgg gee ttg ccc 144 

Leu Gly Asp Arg Val lie Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro 

35 40 45 

acc tac aat aac cac etc tac aag caa ate tec agt get tea acg ggg 192 

Thr Tyr Asn Asn His Leu Tyr Lys Gin He Ser Ser Ala Ser Thr Gly 
50 55 60 

gee age aac gac aac cac tac ttc ggc tac age acc ccc tgg ggg tat 240 

Ala Ser Asn Asp Asn His Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr 

65 70 75 80 



ttt gat ttc aac aga ttc cac tgc cac ttt tea cca cgt gac tgg cag 288 
Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gin 
85 90 95 

cga etc ate aac 'aac aat tgg gga ttc egg ccc aag aga etc aac ttc 336 
Arg Leu He Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe 
100 105 110 

aaa etc ttc aac ate caa gtc aag gag gtc acg acg aat gat ggc gtc 384 
Lys Leu Phe Asn He Gin Val Lys Glu Val Thr Thr Asn Asp Gly Val 
115 120 125 

aca acc ate get aat aac ctt acc age acg gtt caa gtc ttc teg gac 432 
Thr Thr He Ala Asn Asn Leu Thr Ser Thr Val Gin Val Phe Ser Asp 
130 135 140 
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teg gag tac 
Ser Glu Tyr 
145 

etc cct ccg 
Leu Pro Pro 



ctg acg etc 
Leu Thr Leu 



tgc ctg gaa 
Cys Leu Glu 
195 

acc ttc age 
Thr Phe Ser 
210 

cac age cag 
His Ser Gin 
225 

ctg tat tac 
Leu Tyr Tyr 



aag gac ttg 
Lys Asp Leu 

ccc aaa aac 
Pro Lys Asn 
275 

aaa aca aaa 
Lys Thr Lys 
290 

tea aaa tat 
Ser Lys Tyr 
305 

get atg gec 
Ala Met Ala 



cag ctt ccg 
Gin Leu Pro 
150 

ttc ccg gcg 
Phe Pro Ala 
165 

aac aat ggc 
Asn Asn Gly 
180 

tat ttc cct 
Tyr Phe Pro 



tac acc ttt 
Tyr Thr Phe 



age ctg gac 
Ser Leu Asp 
230 

ctg aac aga 
Leu Asn Arg 
245 

ctg ttt age 
Leu Phe Ser 
260 

tgg eta cct 
Trp Leu Pro 



aca gac aac 
Thr Asp Asn 

aac etc aat 
Asn Leu Asn 
310 

tea cac aaa 
Ser His Lys 
325 



tac gee etc 
Tyr Val Leu 



gac gtg ttc 
Asp Val Phe 



age caa gec 
Ser Gin Ala 
185 

tct cag atg 
Ser Gin Met 
200 

gag gaa gtg 
Glu Glu Val 
215 

egg ctg atg 
Arg Leu Met 



act caa aat 
Thr Gin Asn 



cgt ggg tct 
Arg Gly Ser 
265 

gga ccc tgt 
Gly Pro Cys 
280 

aac aac age 
Asn Asn Ser 
295 

ggg cgt gaa 
Gly Arg Glu 



gac gac gaa 
Asp Asp Glu 



ggc tct gcg 
Gly Ser Ala 
155 

atg att ccg 
Met lie Pro 
170 

gtg gga cgt 
Val Gly Arg 

ctg aga acg 
Leu Arg Thr 



cct ttc cac 
Pro Phe His 
220 

aat cct etc 
Asn Pro Leu 
235 

cag tec gga 
Gin Ser Gly 
250 

cca get ggc 
Pro Ala Gly 

tat egg cag 
Tyr Arg Gin 



aat ttt acc 
Asn Phe Thr 
300 

tec ate ate 
Ser He He 
315 

gac aag ttc 
Asp Lys Phe 
330 



cac cag ggc 
His Gin Gly 



caa tac ggc 
Gin Tyr Gly 
175 

tea tec ttt 
Ser Ser Phe 
190 

ggc aac aac 
Gly Asn Asn 
205 

age age tac 
Ser Ser Tyr 



ate gac caa 
lie Asp Gin 



agt gee caa 
Ser Ala Gin 
255 

atg tct gtt 
Met Ser Val 
270 

cag cgc gtt 
Gin Arg Val 
285 

tgg act ggt 
Trp Thr Gly 



aac cct ggc 
Asn Pro Gly 



ttt ccc atg 
Phe Pro Met 
335 



tgc 480 

Cys 

i60 

tac 528 
Tyr 



tac 576 
Tyr 



ttt 624 
Phe 



gcg 672 
Ala 



tac 720 

Tyr 

240 

aac 768 
Asn 



cag 816 
Gin 



tct 864 
Ser 



get 912 
Ala 



act 960 

Thr 

320 

age 1008 
Ser 
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ggt gtc atg att ttt gga aaa gag age gec gga get tea aac act gca 1056 
Gly Val Met lie Phe Gly Lys Glu Ser Ala Gly Ala Ser Asn Thr Ala 
340 345 350 

ttg gac aat gtc atg att aca gac gaa gag gaa att aaa gec act aac 1104 
Leu Asp Asn Val Met lie Thr Asp Glu Glu Glu lie Lys Ala Thr Asn 
355 360 365 

cct gtg gee acc gaa aga ttt ggg acc gtg gca gtc aat ttc cag age 1152 
Pro Val Ala Thr Glu Arg Phe Gly Thr Val Ala Val Asn Phe Gin Ser 
370 375 380 

age age aca gac cct gcg acc gga gat gtg cat get atg gga gca tta 1200 
Ser Ser Thr Asp Pro Ala Thr Gly Asp Val His Ala Met Gly Ala Leu 
385 390 395 400 

cct ggc atg gtg tgg caa gat aga gac gtg tac ctg cag ggt ccc att 1248 
Pro Gly Met Val Trp Gin Asp Arg Asp Val Tyr Leu Gin Gly Pro lie 
405 410 415 

tgg gec aaa att cct cac aca gat gga cac ttt cac ccg tct cct ctt 1296 
Trp Ala Lys lie Pro His Thr Asp Gly His Phe His Pro Ser Pro Leu 
420 425 430 

atg ggc ggc ttt gga etc aag aac ccg cct cct cag ate etc ate aaa 1344 
Met Gly Gly Phe Gly Leu Lys Asn Pro Pro Pro Gin lie Leu He Lys 
435 440 445 

aac acg cct gtt cct gcg aat cct ccg gcg gag ttt tea get aca aag 1392 
Asn Thr Pro Val Pro Ala Asn Pro Pro Ala Glu Phe Ser Ala Thr Lys 
450 455 460 

ttt get tea ttc ate acc caa tac tec aca gga caa gtg agt gtg gaa 1440 
Phe Ala Ser Phe He Thr Gin Tyr Ser Thr Gly Gin Val Ser Val Glu 
465 470 475 480 

att gaa tgg gag <:tg . cag aaa gaa aac age aag cgc tgg aat ccc gaa 1488 
He Glu Trp Glu Leu Gin Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu 
485 490 495 

gtg cag tac aca tec aat tat gca aaa tct gee aac gtt gat ttt act 1536 
Val Gin Tyr Thr Ser Asn Tyr Ala Lys Ser Ala Asn Val Asp Phe Thr 
500 505 510 

gtg gac aac aat gga ctt tat act gag cct cgc ccc att ggc acc cgt 1584 
Val Asp Asn Asn Gly Leu Tyr Thr Glu Pre Arg Pro He Gly Thr Arg 
515 520 525 



49 



WO 00/28061 



PCTAJS99/25694 



tac ctt acc cgt ccc ctg taa 1605 
Tyr Leu Thr Arg Pro Leu 
530 



<210> 17 
<211> 534 
<212> PRT 
<213> AAV-1 

<400> 17 

Met Ala Ser Gly Gly Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala 
15 10 15 

Asp Gly Val Gly Asn Ala Ser Gly Asn Trp His Cys Asp Ser Thr Trp 
20 25 30 

Leu Gly Asp Arg Val lie Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro 
35 40 45 

Thr Tyr Asn Asn His Leu Tyr Lys Gin lie Ser Ser Ala Ser Thr Gly 
50 55 60 

Ala Ser Asn Asp Asn His Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr 
65 70 75 80 

Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gin 
85 90 95 

Arg Leu lie Asn Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe 
100 105 110 

Lys Leu Phe Asn He Gin Val Lys Glu Val Thr Thr Asn Asp Gly Val 
115 120 125 

Thr Thr He Ala Asn Asn Leu Thr Ser Thr Val Gin Val Phe Ser Asp 
130 ^135 140 

Ser Glu Tyr Gin Leu Pro Tyr Val Leu Gly Ser Ala His Gin Gly Cys 
145 150 155 160 

Leu Pro Pro Phe Pro Ala Asp Val Phe Met He Pro Gin Tyr Gly Tyr 
165 170 175 

Leu Thr Leu Asn Asn Gly Ser Gin Ala Val Gly Arg Ser Ser Phe Tyr 
180 185 190 
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Cys_Leu Glu Tyr Phe Pro Ser Gin Met Leu Arg Thr Gly Asn Asn Phe 

195 200 205 

Thr Phe Ser Tyr Thr Phe Glu Glu Val Pro Phe His Ser Ser Tyr Ala 

210 215 220 

His Ser Gin Ser Leu Asp Arg Leu Met Asn Pro Leu lie Asp Gin Tyr 

225 230 235 240 



Leu Tyr Tyr Leu Asn Arg Thr Gin Asn Gin Ser Gly Ser Ala Gin Asn 

245 250 255 

Lys Asp Leu Leu Phe Ser Arg Gly Ser Pro Ala Gly Met Ser Val Gin 
260 265 270 



Pro Lys Asn Trp 
275 

Lys Thr Lys Thr 
290 

Ser Lys Tyr Asn 
305 

Ala Met Ala Ser 



Gly Val Met lie 
340 

Leu Asp Asn Val 
355 

Pro Val Ala Thr 
370 

Ser Ser Thr Asp 
385 

Pro Gly Met Val 



Leu Pro Gly Pro 
280 

Asp Asn Asn Asn 
255 

Leu Asn Gly Arg 
310 

His Lys Asp Asp 
325 

Phe Gly Lys Glu 



Met He Thr Asp 

360 

Glu Arg Phe Gly 
375 

Pro Ala Thr Gly 
3S0 

Trp Gin Asp Arg 
405 



Cys Tyr Arg Gin 



Ser Asn Phe Thr 
300 

Glu Ser He He 
315 

Glu Asp Lys Phe 
330 

Ser Ala Gly Ala 
345 

Glu Glu Glu He 



Thr Val Ala Val 
380 

Asp Val His Ala 
395 

Asp Val Tyr Leu 
410 



Gin Arg Val Ser 
285 

Trp Thr Gly Ala 



Asn Pro Gly Thr 
320 

Phe Pro Met Ser 
335 

Ser Asn Thr Ala 
350 

Lys Ala Thr Asn 
365 

Asn Phe Gin Ser 



Met Gly Ala Leu 
400 

Gin Gly Pro He 
415 



Trp Ala Lys lie Pro His Thr Asp 
420 

Met Gly Gly Phe Gly Leu Lys* Asn 
435 440 



Gly His Phe His Pro Ser Pro Leu 
425 430 

Pro Pro Pro Gin He Leu He Lys 
445 
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Asn "F-hr Pro Val Pro Ala Asn Pro Pro Ala Glu Phe Ser Ala Thr Lys 
450 455 460 

Phe Ala Ser Phe lie Thr Gin Tyr Ser Thr Gly Gin Val Ser Val Glu 
465 t 470 475 480 

lie Glu Trp Glu Leu Gin Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu 
485 490 495 

Val Gin Tyr Thr Ser Asn Tyr Ala Lys Ser Ala Asn Val Asp Phe Thr 
500 505 510 

Val Asp Asn Asn Gly Leu Tyr Thr Glu Pro Arg Pro lie Gly Thr Arg 
515 520 525 

Tyr Leu Thr Arg Pro Leu 
530 



<210> 18 
<211> 4681 
<212> DNA 
<213> aav-2 

<400> 18 

ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60 

cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120 

gccaactcca tcactagggg tccctggagg ggtggagccg tgacgtgaat tacgtcatag 180 

ggttagggag gtcctgtatt agaggtcacg tgagtgtttt gcgacatttt gcgacaccat 240 

gtggtcacgc tgggtattta agcccgagtg agcacgcagg gtctccattt tgaagcggga 300 

ggtttgaacg cgcagccgcc atgccggggt ttcacgagat tgtgattaag gtccccagcg 360 

accttgacgg gcatctgccc ggcatctccg acagctttgt gaactgggtg gccgagaagg 420 

aatgggagtt gccgccagat tctgacatgg atctgaatct gattgagcag gcacccctga 480 

ccgtggccga gaagctgcag cgcgactctc cgacggaatg gcgccgtgtg agtaaggccc 540 

cggaggccct tttctttgtg caatttgaga agggagagag ctacttccac atgcacgtgc 600 

tcgtggaaac caccggggtg aaatccatgg ttrtgggacg tttcctgagt cagattcgcg 660 

aaaaactgat tcagagaatc taccgcggga tcgagccgac tttgccaaac tggttcgcgg 720 
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tcacaaagac cagaaatggc gccggaggcg ggaacaaggt ggtggatgag tgctacatcc 780 
ccaattactt gctccccaaa acccagcctg agctccagtg ggcgtggact aatatggaac 840 
agtatttaag cgcctgtttg aatctcacgg agcgtaaacg gttggtggcg cagcatctga 900 
cgcacgtgtc gcagacgcag gagcagaaca aagagaatca gaatcccaat tctgatgcgc 960 
cggtgatcag atcaaaaact tcagccaggt acatggagct ggtcgggtgg ctcgtggaca 1020 
aggggattac ctcggagaag cagtggatcc aggaggacca ggcctcatac atctccttca 1080 
atgcggcctc caactcgcgg tcccaaatca aggctgcctt ggacaatgcg ggaaagatta 1140 
tgagcctgac taaaaccgcc cccgactacc tggtgggcca gcagcccgtg gaggacattt 1200 
ccagcaatcg gatttataaa attttggaac taaacgggta cgatccccaa tatgcggctt 1260 
ccgtctttct gggatgggcc acgaaaaagt tcggcaagag gaacaccatc tggctgtttg 1320 
ggcctgcaac taccgggaag accaacacccj cggaggccac agcccacact gtgcccttct 1380 
acgggtgcgt aaactggacc aatgagaacc ttcccttcaa cgactgtgtc gacaagatgg 1440 
tgatctggtg ggaggagggg aagatgaccg ccaaggtcgt ggagtcggcc aaagccattc 1500 
tcggaggaag caaggtgcgc gtggacc3ga aacgcaagcc ctcggcccag atagacccga 1560 
ctcccgtgat cgtcacctcc aacaccaaca tgtgcgccgt gattgacggg aactcaacga 1620 
ccttcgaaca ccagcagccg ttgcaagacc ggatgttcaa atttgaactc acccgccgtc 1680 
tggatcatga ctttgggaag gtcaccaagc aggaagtcaa agactttttc cggtgggcaa 1740 
aggatcacgt ggttgaggtg gagcatgaat tctacgtcaa aaagggtgga gccaagaaaa 1800 
gacccgcccc cagtgacgca gatataagtg agcccaaacg ggtgcgcgag tcagttgcgc 1860 
agccatcgac gtcagacgcg gaagcctcga tcaactacgc agacaggtac caaaacaaat 1920 
gttctcgtca cgtgggcatg aatctgatgc tgtttccctg cagacaatgc gagagaatga 1980 
atcagaattc aaatatctgc ttcacucacg gacagaaaga ctgtttagag tgctttcccg 2040 
tgtcagaatc tcaaccggtt tctgtcgcca aaaaggcgta tcagaaactg tgctacattc 2100 
atcatatcat gggaaaggtg ccagacgctt gcactgcctg cgatctggtc aatgtggatt 2160 
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tggatgactg catctttgaa caataaatga tttaaatcag gtatggctgc cgatggttat 2220 
cttccagatt ggctcgagga cactctctct gaaggaataa gacagtggtg gaagctcaaa 2280 
cctggcccac caccaccaaa gcccgcagag cggcataagg acgacagcag gggtcttgtg 2340 
cttcctgggt acaagtacct cggacccttc aacggactc^ acaagggaga gccggtcaac 2400 
gaggcagacg ccgcggccct cgagcacgac aaagcctacg accggcagct cgacagcgga 2460 
gacaacccgt acctcaagta caaccacgcc gacgcggagt ttcaggagcg ccttaaagaa 2520 
gatacgtctt ttgggggcaa cctcggacga gcagtcttcc aggcgaaaaa gagggttctt 2580 
gaacctctcg gcctggttga ggaacctgtt aagacggctc cgggaaaaaa gaggccggta 2640 
gagcactctc ctgtggagcc agactcctcc tcgggaaccg gaaagccggg ccagcagcct 2700 
gcaagaaaaa gattgaattt tggtcagact ggagacgcag actcagtacc tgacccccag 2760 
cctctcggac agccaccagc agccccctct ggtctgggaa ctaatacgat ggctacaggc 2820 
agtggcgcac caatggcaga caataacgag ggcgccgacg gagtgggtaa ttcctccgga 2880 
aattggcatt gcgattccac atgqatgggc gacagagtca tcaccaccag cacccgaacc 2940 
tgggccctgc ccacctacaa caaccacctc tacaaacaaa tttccagcca atcaggagcc 3000 
tcgaacgaca atcactactt tggctacagc accccttggg ggtattttga cttcaacaga 3060 
ttccactgcc acttttcacc acgtgaccgg caaagactca tcaacaacaa ctggggattc 3120 
cgacccaaga gactcaactt caacctcttt aacactcaag tcaaagaggt cacgcagaat 3180 
gacggtacga cgacgattgc caataacctc accagcacgg ttcaggtgtt tactgactcg 3240 
gagtaccagc tcccgtacgt cctcggctcg gcgcatcaag gatgcctccc gccgttccca 3300 
gcagacgtct tcatggtgcc acagtatgga tacctcaccc tgaacaacgg gagtcaggca 3360 
gtaggacgct cttcatttta ctgcccggag tactttcctt ctcagatgct gcgtaccgga 3420 
aacaacttta ccttcagcca cacttttgag gacgctcctt tccacagcag ctacgctcac 3480 
agccagagtc tggaccgtct catgaatcct ctcatcgacc agtacctgta ttacttgagc 3540 
agaacaaaca ctccaagtgg aaccaccacg cagtcaaggc ttcagttttc tcaggcccca 3600 
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gccagtgaca ttcgggacca gtctaggaac tggcttcccg gaccctgtta ccgccagcag 3660 

cgagtatgaa agacatctgc ggataacaac aacagtgaat actcgtggac tggagctacc 3720 

aagtaccacc tcaatggcag agactctctg gtgaatccgg ggcccgccat ggcaagccac 3780 

aaggacgatg aagaaaagtt ttttcctcag agcggggttc tcatctttgg gaagcaaggc 3840 

tcagagaaaa caaatgtgaa cattgaaaag gtcatgatta cagacgaaga ggaaatccca 3900 

acaaccaatc ccgtggctac ggagcagtat ggttctgtat ctaccaacct ccagagaggc 3960 

aacagacaag cagctaccgc agatgtcaac acacaaggcg ttcttccagg catggtctgg 4020 

caggacagag atgtgtacct tcaggggccc atctgggcaa agattccaca cacggacgga 4080 

cattttcacc cctctcccct catgggtgga ttcggactta aacaccctcc tccacagatt 4140 

ctcatcaaga acaccccggt acccgcgaac cctccgacca ccttcagtgc ggcaaagttt 4200 

gcttccttca tcacacagta ctccacggga cacggtcagc gtggagatcg agtgggagct 4260 

gcagaacgaa aacagcaaac gctggaatcc cgaaattcag tacacttcca actacaacaa 4320 

gtctgttaat cgtggacttt accgtggata ctaatggcgt gtattcagag cctcgcccca 4380 

ttggcaccag atacctgact cgtaatctgt aattgcttgt taatcaataa accgtttaat 4440 

ccgtttcagt tgaactttgg tctctgcgta tttcttcctt atctagtttc catggctacg 4500 

tagataagta gcatggcggg ctaatcatta actacaagga acccctagtg atggagttgg 4560 

ccactccctc tctgcgcgct cgctcgctca ctgaggccgg gcgaccaaag gtcgcccgac 4620 

gcccgggctt tgccccggcg gcctcagtga gcgagcgagc gcgcagagag ggagtgggca 4680 
a 4681 



<210> 19 
<211> 4683 
<212> DNA 
<213> aav-6 



<400> 19 

ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60 
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cgacgcccgg gctttgcccg ggcggcc.tca gtgagcgagc gagcgcgcag agagggagtg 120 

gccaactcca tcactagggg ttcctggagg ggtggagtcg tgacgtgaat tacgtcatag 180 
ggttagggag gtcctgtatt agaggtcacg tgagtgtttt gcgacatttt gcgacaccat 240 
gtggtcacgc tgggtattta agcccgagtg agcacgcagg gtctccattt tgaagcggga 300 
ggtttgaacg cgcagcgcca tgccggggtt ttacgagatt gtgattaagg tccccagcga 360 
ccttgacgag catctgcccg gcatttctga cagctttgtg aactgggtgg ccgagaagga 420 
atgggagttg ccgccagatt ccgacatgga tccgaatctg attgagcagg cacccctgac 480 
cgtggccgag aagctgcagc gcgacttcct ggtccactgg cgccgcgtga gtaaggcccc 540 
ggaggccctc ttctttgttc agttcgagaa gggcgagtcc tacttccacc tccatattct 600 
ggtggagacc acgggggtca aatccatggt gctgggccgc ttcctgagtc agattagcga 660 
caagctggtg cagaccatct accgcgggat cgagccgacc ctgcccaact ggttcgcggt 720 
gaccaagacg cgtaatggcg ccggaggggg gaacaaggtg gtggacgagt gctacatccc 780 
caactacctc ctgcccaaga ctcagcccga gctgcagtgg gcgtggacta acatggagga 840 
gtatataagc gcgtgtttaa acctggccga gcgcaaacgg ctcgtggcgc acgacctgac 900 
ccacgtcagc cagacccagg agcagaacaa ggagaatctg aaccccaatt ctgacgcgcc 960 
tgtcatccgg tcaaaaacct ccgcacgcta catggagctg gtcgggtggc tggtggaccg 1020 
gggcatcacc tccgagaagc agtggatcca ggaggaccag gcctcgtaca tctccttcaa 1080 
cgccgcctcc aactcgcggt cccagatcaa ggccgctctg gacaatgccg gcaagatcat 1140 
ggcgctgacc aaatccgcgc ccgactacct ggtaggcccc gctccgcccg ccgacattaa 1200 
aaccaaccgc atttaccgca tcctggagct gaacggctac gaccctgcct acgccggctc 1260 
cgtctttctc ggctgggccc agaaaaggtt cggaaaacgc aacaccatct ggctgtttgg 1320 
gccggccacc acgggcaaga ccaacatcgc ggaagccatc gcccacgccg tgcccttcta 1380 
cggctgcgtc aactggacca acgagaactt tcccttcaac gattgcgtcg acaagatggt 1440 
gatctggtgg gaggagggca agatgacggc caaggtcgtg gagtccgcca aggccattct 1500 
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cggcggcagc aaggtgcgcg tggaccaaaa gtgcaagtcg tccgcccaga tcgatcccac . 1560 

ccccgtgatc gtcacctcca acaccaacat gtgcgccgtg attgacggga acagcaccac 1620 

cttcgagcac cagcagccgt tgcaggaccg gatgttcaaa tttgaactca cccgccgtct 1680 

ggagcatgac tttggcaagg tgacaaagca ggaagtcaaa gagttcttcc gctgggcgca 1740 

ggatcacgtg accgaggtgg cgcatgagtt ctacgtcaga aagggtggag ccaacaacag 1800 

acccgccccc gatgacgcgg ataaaagcga gcccaagcgg gcctgcccct cagtcgcgga 1860 

tccatcgacg tcagacgcgg aaggagctcc ggtggacttt gccgacaggt accaaaacaa 1920 

atgttctcgt cacgcgggca tgcttcagat gctgtttccc tgcaaaacat gcgagagaat 1980 

gaatcagaat ttcaacattt gcttcacgca cgggaccaga gactgttcag aatgtttccc 2040 

cggcgtgtca gaatctcaac cggtcgtcag aaagaggacg tatcggaaac tctgtgccat 2100 

tcatcatctg ctggggcggg ctcccgagat tgcttgctcg gcctgcgatc tggtcaacgt 2160 

ggatctggat gactgtgttc ctgagcaata aatgacttaa accaggtatg gctgccgatg 2220 

gttatcttcc agattggccc gaggacaacc tctctgaggg cattcggcag tggtgggact 2280 

tgaaacctgg agccccgaaa cccaaagcca accagcaaaa gcaggacgac ggccggggtc 2340 

tggtgcttcc tggctacaag tacctcggac ccttcaacgg actcgacaag ggggagcccg 2400 

tcaacgcggc ggatgcagcg gccctcgagc acgacaaggc ctacgaccag cagctcaaag 2460 

cgggtgacaa tccgtacctg cggtacaacc acgccgacgc cgagtttcag gagcgtctgc 2520 

aagaagatac gtcttttggg ggcaacctcg ggcgagcagt cttccaggcc aagaagaggg 2580 

ttctcgaacc ttttggtctg gttgaggaag gtgctaagac ggctcctgga aagaaacgtc 2640 

cggtagagca gtcgccacaa gagccagact cctcctcggg cattggcaag acaggccagc 2700 

agcccgctaa aaagagactc aattttggtc agactggcga ctcagagtca gtccccgacc 2760 

cacaacctct cggagaacct ccagcaaccc ccgctgctgt gggacctact acaatggctt 2820 

caggcggtgg cgcaccaatg gcagacaata acgaaggcgc cgacggagtg ggtaatgcct 2880 

caggaaattg gcattgcgat tccacatggc tgggcgacag agtcatcacc accagcaccc 2940 
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gaacatgggc cttgcccacc tataacaacc acccctacaa gcaaatctcc agtgcttcaa 3000 

cgggggccag caacgacaac cactacttcg gctacagcac cccctggggg tattttgatt 3060 

tcaacagatt ccactgccat ttctcaccac gtgactggca gcgactcatc aacaacaatt 3120 

ggggattccg gcccaagaga ctcaacttca agctcttcaa catccaagtc aaggaggtca 3180 

cgacgaatga tggcgtcacg accatcgcta ataaccttac cagcacggtt caagtcttgt 3240 

cggactcgga gtaccagttc ccgtacgtcc tcggctctgc gcaccagggc tgcctccctc 3300 

cgttcccggc ggacgtgttc atgatcccgc agtacggcta cctaacgctc aacaatggca 3360 

gccaggcagt gggacgctca tccttttact gcctggaata tttcccatcg cagatgctga 3420 

gaacgggcaa taactttacc ttcagctaca ccttcgagga cgtgcctttc cacagcagct 3480 

acgcgcacag ccagagcctg gaccggctga tgaatcctct catcgaccag tacctgtatt 3540 

acctgaacag aactcacaat cagtccggaa gtgcccaaaa caaggacttg ctgtttagcc 3600 

gtgggtctcc agctggcatg tctgtccagc ccaaaaactg gctacctgga ccctgttacc 3660 

ggcagcagcg cgtttctaaa acaaaaacag acaacaacaa cagcaacttt acctggactg 3720 

gtgcttcaaa atataacctt aatgggcgtg aacctataat caaccctggc actgctatgg 3780 

cctcacacaa agacgacaaa gacaagttct ttcccatgag cggtgtcatg atttttggaa 3840 

aggagagcgc cggagcttca aacaccgcac tggacaatgt catgatcaca gacgaagagg 3900 

aaatcaaagc cactaacccc gtggccaccg aaagacttgg gactgtggca gtcaatctcc 3960 

agagcagcag cacagaccct gcgaccggag atgtgcatgt tatgggagcc ttacctggaa 4020 

tggtgtggca agacagagac gtatacctgc agggtcctat ttgggccaaa attcctcaca 4080 

cggatggaca ctttcacccg tctcctctca tgggcggctt tggacttaag cacccgcctc 4140 

ctcagatcct catcaaaaac acgcctgttc ctgcgaatcc tccggcagag ttttcggcta 4200 

caaagtttgc ttcattcatc acccagtatt ccacagqaca agtgagcgtg gagattgaat 4260 

gggagctgca gaaagaaaac agcaaacgct ggaatcccga agtgcagtat acatctaact 4320 

atgcaaaatc tgccaacgtt gatttcactg tggacaacaa tggactttat actgagcctc 4380 
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gccccattgg cacccgttac ctcacccgtc 

gttaattcgt gtcagttgaa ctttggtctc 
gcaaccggtt acacattaac . tgcttagttg 
gcccactccc tctatgcgcg ctcgctcgct 
tctgcggacc tttggtccgc aggccccacc 
caa 

<210> 20 
<211> 16 
<212> DNA 

<213> rep binding motif 
<400> 20 

gctcgctcgc tcgctg 



PCT/US99/25694 

ccctgtaatt gtgtgttaat caataaaccg 4440 

atgtccttat tatcttatct ggtcaccata 4500 
cgcttcgcga atacccctag tgatggagtt 4560 
cggtggggcc ggcagagcag agctctgccg 4620 
gagcgagcga gcgcgcatag agggagtggc 4680 

4683 
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